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PCT/US96/15524 



PRODUCTION OF INFECTIOUS RESPIRATORY SYNCYTIAL 
VIRUS FROM CLONED NUCLEOTIDE SEQUENCES 



Background of the Invention 



Human respiratory syncytial virus (RSV) is the most 
important pediatric respiratory pathogen worldwide. This 
ubiquitous, highly infectious agent emerges each year in 
seasonal epidemics. Nearly everyone is infected at least once 
within the first two years of life. RSV disease is 
responsible for considerable morbidity and mortality, 
especially in the very young; in the United States it causes 
an estimated 91,000 hospitalizations and 4500 deaths annually, 
and its impact is much greater in less affluent countries. 
RSV also has come to be recognized as an important agent of 
disease of immunocompromised adults and of the elderly. 

Resistance to RSV reinfection induced by natural 
infection is incomplete but increases incrementally with 
repeated exposure. Thus, RSV can infect multiple times during 
childhood and life, but serious disease usually is limited to 
the first and sometimes second infections of life. The 
minimum goal of RSV immunoprophylaxis is to induce sufficient 
resistance to prevent serious disease associated with the 
initial infections . 

A number of attenuated RSV strains were developed 
and evaluated as vaccines during the 1960's and 70' s, but they 
were found to be either over- or under-attenuated, and in some 
cases exhibited genetic instability, as is common for single- 
stranded RNA viruses. Current strategies under investigation 
for RSV vaccine development are principally the parenteral 
administration of purified viral antigen or the development of 
live attenuated RSV for intranasal administration. The 
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intranasal route provides direct stimulation of local 
immunity. It also partially abrogates the immunosuppressive 
effects of RSV-specific maternally derived serum antibodies, 
which typically are found in the very young. The parenteral 
administration of inactivated RSV or purified RSV antigen in 
experimental animals appears to be associated with enhanced 
immunopathology upon subsequent virus challenge, similar to 
the enhanced RSV disease associated with a formalin- 
inactivated vaccine evaluated in the 1960's. But this effect 
has never been observed with RSV infection of the respiratory 
tract, suggesting that live attenuated viruses have an 
important advantage in safety. To date, however, there is no 
approved vaccine or highly effective antiviral therapy for 
RSV. 

Research efforts to produce a suitable RSV vaccine 
are impeded by poor viral growth in tissue culture, a lengthy 
replication cycle, virion instability, a negative-sense RNA 
genome, and a complex genome organization and gene products. 
RSV is a member of the pneumovirus genus of the paramyxovirus 
family, and its genome of single -stranded negative- sense RNA 
of 15,222 nucleotides has been sequenced completely for wild- 
type strain A2 virus as well as for an attenuated derivative 
thereof . 

Some aspects of RNA synthesis by RSV appear to 
follow the general pattern of nonsegmented negative strand 
viruses. The genome template is tightly encapsidated with the 
major nucleocapsid (N) protein and is associated with the 
phosphoprotein (P) and large (L) polymerase subunit protein. 
Transcription begins at the 3' extragenic leader region and 
proceeds along the entire length by a sequential, stop- start 
mechanism guided by short template signals flanking the genes. 
This yields at least ten major species of mRNA which encode at 
least ten major proteins. RNA replication occurs by a switch 
to the synthesis of a full length positive-sense "antigenome" 
which also is tightly encapsidated and serves as the template 
for the synthesis of progeny genome. 
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The viral genomic RNA of negative -strand viruses is 
not infectious alone as free RNA. In virions or 
intracellularly, viral RNA is always found tightly 
encapsidated in a ribonucleoprotein core. This nucleocapsid 
contains the viral proteins necessary for transcription and 
replication and has long been regarded as the minimum unit of 
infectivity (Brown et al . , J. Virol. 1: 368-373 (1967)). 
Thus, it has been recognized that the generation of 
biologically active synthetic viral RNA from cDNA will require 
complementation by viral protein, leading to the assembly of 
functional nucleocapsids (Collins et al . , Proc. Natl. Acad. 
Sci. USA 88: 9663-9667 (1991), and Collins et al . , Virology 
195: 252-256 (1993)). The ability to produce live RSV from 
cDNA is of particular importance because it would permit the 
introduction of specific engineered changes, including 
attenuating mutations, into the genome of infectious virus in 
an effort to produce safe and effective RSV vaccines. 

Short, internally-deleted analogs of genome or 
antigenome RNA ( "minigenomes" ) have been shown to participate 
in transcription and replication when synthesized 
intracellularly in the presence of the appropriate viral 
proteins. For two rhabdoviruses , rabies and vesicular 
stomatitis viruses, infectious virus has been produced by 
coexpression of a complete cDNA-encoded antigenome RNA in the 
presence of the N, P and L proteins (Schnell et al . , EMBO J. 
13: 4195-4203 (1994) and Lawson et al., Proc. Natl. Acad. Sci. 
USA 92: 4477-4481 (1995)). 

RSV possesses a number of properties which 
distinguishes it and other members of the genus Pneimovxrus 
from the better characterized paramyxoviruses of the genera 
Paramyxovirus, Rubulavirus and Morbillivirus . These 
differences include a greater number of mRNAs, an unusual gene 
order at the 3' end of the genome, species- to- species 
variability in the order of the glycoprotein and M2 genes, a 
greater diversity in intergenic regions, an attachment protein 
that exhibits mucin-like characteristics, extensive strain-to- 
strain sequence diversity, and several proteins not found in 
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any or most of the other nonsegmented negative strand RNA 
viruses . 

RSV remains the most common cause of severe viral 
lower respiratory tract disease in infants and children. 
Consequently, an urgent need remains for the ability to 
engineer a safe and effective vaccine that is able to prevent 
the serious illness in this population that often requires 
hospitalization. Quite surprisingly, the present invention 
fulfills this and other related needs by providing methods for 
introducing defined, predetermined changes into infectious 
RSV. 



Summarv of the Invention 

The present invention provides an isolated 
infectious RSV particle which comprises a recombinant RSV 
genome or antigenome, a nucleocapsid (N) protein, a 
nucleocapsid phosphoprotein (P) , a large (L) polymerase 
protein, and an RNA polymerase elongation factor. The RNA 
polymerase elongation factor can be M2 (ORFl) of RSV. The 
isolated infectious RSV particle can be a viral or subviral 
particle. The isolated infectious RSV virus may be a human 
RSV, a bovine or murine RSV, or the genome or antigenome can 
be a chimera of two or more different RSV genomes, such as 
having nucleotide segments from human and bovine RSV. 

In other embodiments the invention provides a 
method for producing an infectious RSV particle from one or 
more isolated polynucleotide molecules encoding an RSV. An 
expression vector which comprises an isolated polynucleotide 
molecule encoding a RSV genome or antigenome and an expression 
vector which comprises one or more isolated polynucleotide 
molecules that encodes N, P, L and RNA polymerase elongation 
factor proteins are coexpressed in a cell or cell-free lysate, 
thereby producing an infectious RSV particle. The RSV genome 
or antigenome and the N, P, L and RNA polymerase elongation 
factor proteins can be coexpressed by the same or different 
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expression vectors. In some instances the N, P, L and RNA 
polymerase elongation factor proteins are each encoded on 
different expression vectors. The polynucleotide molecule 
encoding the RSV genome or antigenome is from a human, bovine 
or murine RSV sequence, and can be a chimera of a human RSV 
strain sequence and at least one non-human RSV sequence, or 
can encodes the genome or antigenome of a wild- type RSV 
strain. The RSV genome or antigenome can be modified from a 
wild- type RSV strain by a nucleotide insertion, rearrangement, 
deletion or substitution, so as to encode a phenotypic 
alteration such as one that results in attenuation, 
temperature-sensitivity, cold-adaptation, small plaque size, 
host range restriction, or a change in an immunogenic epitope 
of RSV. The polynucleotide can encode a genome or antigenome 
of a nonhuman RSV virus, or can be a chimera of a nonhuman RSV 
and at least one other RSV or human or nonhuman origin. The 
polynucleotide molecule encoding the genome or antigenome can 
also be modified to include a nucleotide sequence that encodes 
a cytokine, a T-helper epitope, a G protein of a different RSV 
subgroup, a restriction site marker, or a protein of a 
microbial pathogen (e.g., virus, bacterium or fungus) capable 
of eliciting a protective immune response in the intended 
host . 

In other embodiments the invention provides a cell 
or cell-free lysate containing an expression vector which 
comprises an isolated polynucleotide molecule encoding a RSV 
genome or antigenome and an expression vector which comprises 
one or more isolated polynucleotide molecules that encodes N, 
F, L and RNA polymerase elongation factor proteins of RSV. 
Upon expression the genome or antigenome and N, P, L, and RNA 
polymerase elongation factor proteins combine to produce an 
infectious RSV particle, such as viral or subviral particle. 

In another aspect the invention provides an 
isolated polynucleotide molecule which comprises an operably 
linked transcriptional promoter, a polynucleotide sequence 
encoding an RSV genome or antigenome, and a transcriptional 
terminator. The RSV genome or antigenome can be a human RSV 
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sequence and modified versions thereof, such as that 
exemplified in SEQ ID NO : 1 (which depicts the 5' to 3 ' 
positive-sense sequence whereas the genome itself is negative- 
sense) . The polynucleotide can also encodes a genome or 
antigenome of a nonhuman RSV virus, or encode a chimera of a 
nonhuman RSV and at least one other RSV of human or nonhuman 
origin. 

Brief Description of the Drawings 

Fig. lA and IB show the construction of cDNA 
encoding RSV antigenome RNA, where Fig. lA shows the 
structures of the cDNA and the encoded antigenome RNA (not to 
scale) . The diagram of the antigenome includes the following 
features: the 5' -terminal nonviral G triplet contributed by 
the T7 promoter, the four sequence markers at positions 109 9 
(which adds one nt to the length) , 1139, 5611, and 7559 
(numbering referring to the first base of the new restriction 
site) , the ribozyme and tandem T7 terminators, and the single 
nonviral 3 ' -phosphorylated U residue contributed to the 3' end 
by ribozyme cleavage (the site of cleavage is indicated with 
an arrow) . Cloned cDNA segments representing in aggregate the 
complete antigenome are also shown. The box illustrates the 
removal of the BamHI site, a modification that facilitated 
assembly: the naturally occurring BamHI -Sail fragment (the 
BamHI site is shown in the top line in positive sense, 
underlined) was replaced with a PCR-generated Bglll-Sall 
fragment (the Bglll site is shown in the bottom line, 
underlined; its 4-nt sticky end [shown in italics] is 
compatible with that of BamHI) . This resulted in a single nt 
change (middle line, underlined) which was silent at the amino 
acid level . 

Fig. IB shows the sequence markers contained in the 
cDNA-encoded antigenome RNA, where sequences are positive 
sense and numbered relative to the first nt of the leader 
region complement as 1; identities between strains A2 and 
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18537, representing subgroups A and B, respectively, are 
indicated with dots; sequences representing restriction sites 
in the cDNA are underlined; gene-start (GS) and gene-end (GE) 
transcription signals are boxed; the initiation codon of the N 
translational open reading frame at position 1141 is 
italicized, and the sequence markers are shown underneath each 
sequence. In the top sequence, a single C residue was 
inserted at position 1099 to create an Aflll site in the 
NSII-N intergenic region, and the AG at positions 113 9 and 
1140 immediately upstream of the N translational open reading 
frame were replaced with CC to create a new Ncol site. In the 
middle sequence, substitution of G and U at positions 5612 and 
5616, respectively, created a new StuI site in the G-F 
intergenic region. In the bottom sequence, a C replacement at 
position 7560 created a new SphI site in the F-M2 intergenic 
region . 

Fig. 2 shows construction of D46/1024CAT cDNA 
encoding an RSV antigenome containing the CAT ORF flanked by 
RSV transcription signals (not to scale, RSV-specific segments 
are shown as filled boxes and CAT sequence as an open box) . 
The source of the CAT gene transcription cassette was RSV-CAT 
minigenome cDNA 6196 (diagram at top) . The RSV-CAT minigenome 
contains the leader region, gene-start (GS) and gene-end (GE) 
signals, noncoding (NC) RSV gene sequences, and the CAT ORF, 
with Xmal restriction endonuclease sites preceding the GS 
signal and following the GE signal. The nucleotide lengths of 
these elements are indicated, and the sequences (positive- 
sense) surrounding the Xmal sites are shown above the diagram. 
A 8-nucleotide XmaJ linker was inserted into StuI site of the 
parental plasmid D46 to construct the plasmid D46/1024. The 
Xmal - XmaJ fragment of the plasmid 6196 was inserted into the 
plasmid D46/1024 to construct the plasmid D46/1024CAT. The 
RNA encoded by the D4 6 cDNA is shown at the bottom, including 
the three 5' -terminal nonviral G residues contributed by the 
T7 promoter and the 3' -terminal phosphorylated U residue 
contributed by cleavage of the hammerhead ribozyme; the 
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nucleotide lengths do not include these nonviral nucleotides. 
The L gene is drawn offset to indicate the gene overlap. 

5 Description of the Specific Embodiments 

The present invention provides the production of 
infectious RSV from cDNA. Infectious RSV is produced by the 
intracellular coexpression of a cDNA that encodes the RSV 

10 genome or antigenome RNA, together with those viral proteins 
necessary to generate a transcribing, replicating 
nucleocapsid, preferably one or more sequences that encode 
major nucleocapsid (N or NP) protein, nucleocapsid 
phosphoprotein (P) , large (L) polymerase protein, and an 

15 M2(ORFl) protein. Infectious RSV particles are produced by 
the recombinant system. The recombinant production system 
permits the introduction of defined changes into infectious 
RSV, which changes are useful in a wide variety of 
applications such as: the development of live attenuated 

20 vaccine strains bearing predetermined, defined attenuating 

mutations; analysis of RSV molecular biology and pathogenesis 
using, e.g., defined mutations to alter functions or 
expression of RSV proteins; improvement in the growth in 
culture; identification of attenuating mutations in existing 

25 or future vaccine strains by distinguishing between silent 
incidental mutations versus those responsible for phenotype 
differences; production of modified vaccine virus to 
accommodate antigenic drift; enhancement of vaccine 
immunogenicity ; ablation of epitopes associated with 

30 undesirable immunopathology ; insertion of foreign genes, in 

whole or in part, encoding protective antigens to generate RSV 
strains capable of inducing immunity to both RSV and the virus 
or agent from which the protective antigen was derived; 
insertion of foreign genes, in whole or in part, encoding 

35 modulators of the immune system such as cytokines or T cell 

epitopes, to enhance the immunogenicity of the vaccine virus; 
etc . 



I 



wo 97/12032 PCT/US96/15524 

9 

According to the present invention cDNA encoding a 
RSV genome or antigenotne is constructed for intracellular or 
in vitro coexpression with the necessary viral proteins to 
form infectious RSV. By "RSV antigenome" is meant an isolated 
5 positive-sense polynucleotide molecule which serves as the 

template for the synthesis of progeny RSV genome. Preferably 
a cDNA is constructed which is a positive-sense version of the 
RSV genome, corresponding to the replicative intermediate RNA, 
or antigenome, so as to minimize the possibility of 
10 hybridizing with positive-sense transcripts of the 

complementing sequences that encode proteins necessary to 
generate a transcribing, replicating nucleocapsid, i.e., 
sequences that encode N, P, L and M2 (ORFl) protein. In an RSV 
minigenome system, genome and antigenome were equally active 
15 in rescue, whether complemented by RSV or by plasmids, 

indicating that either genome or antigenome can be used and 
thus the choice can be made on methodologic or other grounds. 

A native RSV genome typically comprises a negative- 
sense polynucleotide molecule which, through complementary 
20 viral mRNAs, encodes eleven species of viral proteins, i.e., 

the nonstructural species NSl and NS2, N, P, matrix (M) . small 
hydrophobic (SH) , glycoprotein (G) , fusion (F) , M2 (ORFl) , 
M2(ORF2), and L, substantially as described in Mink et al., 
Virology 185: 615-624 (1991), Stec et al . , Virology 183: 273- 
25 287 (1991), and Connors et al . , Virol ■ 208:478-484 (1995), 

incorporated herein by reference. For purposes of the present 
invention the genome or antigenome of the recombinant RSV of 
the invention need only contain those genes or portions 
thereof necessary to render the viral or subviral particles 
30 encoded thereby infectious. Further, the genes or portions 
thereof may be provided by more than one polynucleotide 
molecule, i.e., a gene may be provided by complementation or 
the li)ce from a separate nucleotide molecule. 

By recombinant RSV is meant a RSV or RSV- like viral 
35 or subviral particle derived directly or indirectly from a 
recombinant expression system or propagated from virus or 
subviral particles produced therefrom. The recombinant 
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expression system will employ a recombinant expression vector 
which comprises an operably linked transcriptional unit 
comprising an assembly of at least a genetic element or 
elements having a regulatory role in RSV gene expression, for 
5 example, a promoter, a structural or coding sequence which is 
transcribed into RSV RNA, and appropriate transcription 
initiation and termination sequences. 

To produce infectious RSV from cDNA-expressed 
genome or antigenome, the genome or antigenome is coexpressed 

10 with those RSV proteins necessary to (i) produce a 

nucleocapsid capable of RNA replication, and (ii) render 
progeny nucleocapsids competent for both RNA replication and 
transcription. Transcription by the genome nucleocapsid 
provides the other RSV proteins and initiates a productive 

15 infection. Alternatively, additional RSV proteins needed for 
a productive infection can be supplied by coexpression . 

An RSV antigenome may be constructed for use in the 
present invention by, e.g., assembling cloned cDNA segments, 
representing in aggregate the complete antigenome, by 

20 polymerase chain reaction (PGR; described in, e.g., U.S. 

Patent Nos . 4,683,195 and 4,683,202, and PCR Protocols: A 
Guide to Methods and Applications^ Innis et al . , eds . , 
Academic Press, San Diego (1990) , incorporated herein by 
reference) of reverse-transcribed copies of RSV mRNA or genome 

25 RNA. For example, cDNAs containing the lefthand end of the 
antigenome, spanning from an appropriate promoter (e.g., T7 
RNA polymerase promoter) and the leader region complement to 
the SH gene, are assembled in an appropriate expression 
vector, such as a plasmid (e.g., pBR322) or various available 

3 0 cosmid, phage, or DNA virus vectors. The vector may be 
modified by mutagenesis and/or insertion of synthetic 
polylinker containing unique restriction sites designed to 
facilitate assembly. For example, a plasmid vector described 
herein was derived from pBR322 by replacement of the Pstl- 

3 5 EcoRl fragment with a synthetic DNA containing convenient 

restriction enzyme sites. pBR322 stabilized nucleotides 3716- 
3732 of the RSV sequence, which otherwise sustained nucleotide 
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deletions or insertions, and propagation of the plasmid was in 
bacterial strain DHIOB to avoid an artifactual duplication and 
insertion which otherwise occurred in the vicinity of nt4499. 
For ease of preparation the G, F and M2 genes can be assembled 
5 in a separate vector, as are the L and trailer sequences. The 
righthand end (e.g., L and trailer sequences) of the 
antigenome plasmid may contain additional sequences as 
desired, such as a flanking ribozyme and tandem T7 
transcriptional terminators. The ribozyme can be hammerhead 

10 type (e.g., Grosfeld et al . , J. Virol. 69:5677-5686 (1995)), 
which would yield a 3' end containing a single nonviral 
nucleotide, or can any of the other suitable ribozymes such as 
that of hepatitis delta virus (Perrotta et al . , Nature 
350:434-436 (1991)) which would yield a 3' end free of non-RSV 

15 nucleotides. A middle segment (e.g., G-to-M2 piece) is 
inserted into an appropriate restriction site of the 
leader-to-SH plasmid, which in turn is the recipient for the 
L- trailer-ribozyme-terminator piece, yielding a complete 
antigenome. In an illustrative example shown in Fig. lA, the 

20 leader end was constructed to abut the promoter for T7 RNA 
polymerase which included three transcribed G residues for 
optimal activity; transcription donates these three nonviral 
G's to the 5' end of the antigenome. These three nonviral G 
residues can be omitted to yield a 5' end free of nonviral 

25 nucleotides. To generate a nearly-correct 3' end, the trailer 
end was constructed to be adjacent to a hammerhead ribozyme, 
which upon cleavage would donate a single 3 ' -phosphorylated U 
residue to the 3' end of the encoded RNA. 

A variety of nucleotide insertions and deletions 

30 can be made in the RSV genome or antigenome. The nucleotide 
length of the genome of wild type human RSV (15,222 
nucleotides) is a multiple of six, and members of the 
Paramyxovirus and Morbillivirus genera typically abide by a 
"rule of six," i.e., genomes (or minigenomes) replicate 

35 efficiently only when their nucleotide length is a multiple of 
six (thought to be a requirement for precise spacing of 
nucleotide residues relative to encapsidating NP protein) . 
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Alteration of RSV genome length by single residue increments 
had no effect on the efficiency of replication, and sequence 
analysis of several different minigenome mutants following 
passage showed that the length differences were maintained 
5 without compensatory changes. Thus, RSV lacks the strict 
requirement of genome length being a multiple of six, and 
nucleotide insertions and deletions can be made in the RSV 
genome or antigenome without defeating replication of the 
recombinant RSV of the present invention. 

10 Alternative means to construct cDNA encoding the 

genome or antigenome include by reverse transcription-PCR 
using improved PGR conditions (e.g., as described in Cheng et 
al,, Proc. Natl. Acad. Sci. USA 91:5695-5699 (1994)), 
incorporated herein by reference) to reduce the number of 

15 subunit cDNA components to as few as one or two pieces. In 
other embodiments different promoters can be used (e.g., TS, 
SP6) or different ribozymes (e.g., that of hepatitis delta 
virus. Different DNA vectors (e.g., cosmids) can be used for 
propagation to better accommodate the larger size genome or 

2 0 an t igenome . 

By virtue of the present invention a variety of 
alterations in the RSV genome or antigenome for incorporation 
into infectious recombinant RSV are made possible . For 
example, foreign genes may be inserted, the order of genes 

2 5 changed, gene overlap removed, the RSV genome promoter 

replaced with its antigenome counterpart, portions of genes 
removed (e.g., the cytoplasmic tails of glycoprotein genes), 
and even entire genes deleted. Modifications in the sequence 
can be made to facilitate manipulations, such as the insertion 

30 of unique restriction sites in various intergenic regions 
(e.g., a unique Stul site between the G and F genes) or 
elsewhere. Nontranslated gene sequences can be removed to 
increase capacity for inserting foreign sequences. 

The infectious RSV produced from cDNA- expressed 

35 genome or antigenome can be any of the RSV or RSV-like 
strains, e.g., human, bovine, murine, etc., or of any 
pneumovirus, e.g., pneumonia virus of mice or turkey 
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rhinotracheitis virus. To engender a protective immune 
response, the RSV strain may be one which is endogenous to the 
subject being immunized, such as human RSV being used to 
immunize humans. The genome or antigenome can be modified, 
however, to express RSV genes from different types. Thus, 
infectious RSV intended for administration to humans may be 
human RSV that has been modified to contain genes from a 
bovine or murine RSV type such as for the purpose of 
attenuation, or a bovine RSV may be modified to contain genes 
that encode epitopes or proteins that elicit protection 
against human RSV infection, e.g., the human RSV glycoprotein 
genes can be substituted for the bovine glycoprotein genes 
such that the bovine RSV. which has a restricted ability to 
replicate in a human host, elicits a protective immune 
response in humans against human RSV strains. 

The N, P and L proteins, necessary for RNA 
replication, require an RNA polymerase elongation factor such 
as the M2(ORFl) protein for processive transcription. Thus 
M2(ORFl) or a substantially equivalent transcription 
elongation factor for negative strand RNA viruses is required 
for the production of infectious RSV and is a necessary 
component of functional nucleocapsids during productive 
infection. The need for the M2(ORFl) protein is consistent 
with its role as a transcription elongation factor. The need 
for expression of the RNA polymerase elongation factor protein 
for negative strand RNA viruses is a feature of the present 
invention. M2 (ORFl) can be supplied by expression of the 
complete M2-gene, although in this form the second ORF2 may 
also be expressed and have an inhibitory effect on RNA 
replication. Therefore, for production of infectious virus 
using the complete M2 gene the activities of the two ORFs 
should be balanced to permit sufficient expression of M(ORFl) 
to provide transcription elongation activity yet not so much 
of M(ORF2) to inhibit RNA replication. Alternatively, the 
ORFl protein is provided from a cDNA engineered to lack ORF2 
or which encodes a defective ORF2 . Efficiency of virus 
production may also be improved by co-expression of additional 



wo 97/1 2032 PCTAJS96/1 5524 

14 

viral protein genes, such as those encoding envelope 
constituents (i.e., SH, M, G, F proteins). 

Isolated polynucleotides (e.g., cDNA) encoding the 
genome or antigenome and, separately, the N, P, L and M2(ORFl) 
proteins, are inserted by transf ection, electroporation, 
mechanical insertion, transduction or the like, into cells 
which are capable of supporting a productive RSV infection, 
e.g., HEp-2, FRhL-DBS2, MRC, and Vero cells. Transfection of 
isolated polynucleotide sequences may be introduced into 
cultured cells by, for example, calcium phosphate-mediated 
transfection (Wigler et al . , Cell 14: 725 (1978); Corsaro and 
Pearson, Somatic Cell Genetics 7: 603 (1981) ; Graham and Van 
der Eb, Viroloov 52: 456 (1973)), electroporation (Neumann et 
al., EMBO J . 1: 841-845 (1982)), DEAE-dext ran mediated 
transfection (Ausubel et al . , (ed. ) Current Protocols in 
Molecula-r Biolocrv , John Wiley and Sons, Inc., NY (1987), 
incorporated herein by reference) , cationic lipid-mediated 
transfection (Hawley-Nelson et al . , Focus 15: 73-79 (1993)) or 
a commercially available transfection regent, e.g., 
LipofectACE® (Life Technologies) . The N, P, L and M2(ORFl) 
proteins are encoded by one or more expression vectors which 
can be the same or separate from that which encodes the genome 
or antigenome, and various combinations thereof. Additional 
proteins may be included as desired, encoded on a its own 
vector or on a vector encoding a N, P, L, or M2(0RF1) protein 
or the complete genome or antigenome. Expression of the 
genome or antigenome and proteins from transf ected plasmids 
can be achieved, for example, by each cDNA being under the 
control of a promoter for T7 RNA polymerase, which in turn is 
supplied by infection, transfection or transduction with an 
expression system for the T7 RNA polymerase, e.g., a vaccinia 
virus MVA strain recombinant which expresses the T7 RNA 
polymerase (Wyatt et al., ViroloQV . 210: 202-205 (1995), 
incorporated herein by reference) . The viral proteins, and/or 
T7 RNA polymerase, can also be provided from transformed 
mammalian cells, or by transfection of preformed mRNA or 
protein. 
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Alternatively, synthesis of antigenome or genome 
together with the above-mentioned viral proteins can be done 
in vitro (cell-free) in a combined transcription- translation 
reaction, followed by transfection into cells. Or, antigenome 
or genome RNA can be synthesized in vitro and transfected into 
cells expressing RSV proteins. 

Having the infectious clone of the invention 
permits the alteration of the RSV genome (or antigenome) by 
introducing defined mutations. By "infectious clone" is meant 
cDNA or its product, synthetic or otherwise, which can be 
transcribed into genomic or antigenomic RNA capable of serving 
as template to produce the genome of infectious virus or 
subviral particle. Defined mutations can be introduced by 
conventional techniques (e.g., site-directed mutagenesis) into 
a cDNA copy of the genome or antigenome. The use of 
antigenome cDNA subfragments to assemble a complete antigenome 
cDNA as described herein has the advantage that each region 
can be manipulated separately (smaller cDNAs are easier to 
manipulate than large ones) and then readily assembled into a 
complete cDNA. Thus, the complete antigenome or genome cDNA, 
or any subfragment thereof, can be used as template for 
oligonucleotide-directed mutagenesis. This can be through the 
intermediate of a single-stranded phagemid form, such as using 
the Muta-gen kit of Bio-Rad, or a method using the double- 
stranded plasmid directly as template such as the Chameleon 
mutagenesis kit of Stratagene, or by the polymerase chain 
reaction employing either an oligonucleotide primer or 
template which contains the mutation (s) of interest. A 
mutated subfragment can then be assembled into the complete 
antigenome or genome cDNA. A variety of other mutagenesis 
techniques are known and available for use in producing the 
mutations of interest in the RSV antigenome or genome cDNA. 
Mutations can vary from single nucleotide changes to 
replacement of large cDNA pieces containing one or more genes 
or genome regions . 

Thus, in one illustrative embodiment mutations are 
introduced by using the Muta-gene phagemid in vitro 
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mutagenesis kit available from Bio-Rad Laboratories, Richmond, 
CA. In brief, cDNA encoding an RSV genome or antigenome is 
cloned into the plasmid pTZlSU, and used to transform DHSof F' 
cells (Life Technologies Inc., Gaithersburg, MD) . Phagemid 
5 preparations are prepared as recommended by the manufacturer. 
Oligonucleotides are designed for mutagenesis by introduction 
of an altered nucleotide at the desired position of the genome 
or antigenome. The plasmid containing the genetically altered 
genome or antigenome is then amplified. 

10 The ability to introduce defined mutations into 

infectious RSV has many applications, including the analyses 
of RSV molecular biology and pathogenesis. For example, the 
functions of the RSV proteins, including the NSl, NS2, SH, 
M2(ORFl) and M2 (ORF2) proteins, can be investigated by 

15 introducing mutations which ablate or reduce their level of 
expression, or which yield mutant protein. 

As another example, the sequence at the cleavage 
site of the ' F protein, or the putative attachment domain of 
the G protein, can be modified to evaluate effects on growth 

20 in tissue culture and infection and pathogenesis in 
experimental animals . 

The roles of various genome RNA structural 
features, such as promoters, intergenic regions, gene overlap, 
and transcription signals, can be evaluated using the methods 

25 and compositions of the present invention. Evaluation of 

trans-acting proteins and cis -acting RNA sequences using the 
complete antigenome cDNA can be conducted in parallel using 
RSV minigenomes (e.g., Grosfeld et al., J . Virol . 69 : 5677- 
5686 (1995) , incorporated herein by reference) , whose 

30 helper-dependent status is useful in the characterization of 
those mutants which are too inhibitory to be recovered in 
replication- independent infectious virus . 

A number of attenuated RSV strains as candidate 
vaccines for intranasal administration have been developed 

3 5 using multiple rounds of chemical mutagenesis to introduce 
multiple mutations into a virus which had already been 
attenuated during cold-passage (e.g., Connors et al . , Virology 
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208: 478-484 (1995); Crowe et al . , Vaccine 12: 691-699 (1994); 
and Crowe et al . , Vaccine 12: 783-790 (1994), incorporated 
herein by reference) . Evaluation in rodents, chimpanzees, 
adults and infants indicate that certain of these candidate 
5 vaccine strains are relatively stable genetically, are highly 
immunogenic, and may be satisfactorily attenuated. Nucleotide 
sequence analysis of some of these attenuated viruses 
indicates that each level of increased attenuation is 
associated with two or more new nucleotide and amino acid 

10 substitutions (Connors et al., suora ) . The present invention 
provides the ability to distinguish between silent incidental 
mutations versus those responsible for phenotype differences 
by introducing the mutations, separately and in various 
combinations, into the genome or antigenome of infectious 

15 wild-type RSV. This process identifies mutations responsible 
for phenotypes such as attenuation, temperature sensitivity, 
cold-adaptation, small plaque size, host range restriction, 
etc. Mutations from this menu can then be introduced in 
various combinations to calibrate a vaccine virus to an 

20 appropriate level of attenuation, etc., as desired. Moreover, 
the present invention provides the ability to combine 
mutations from different strains of virus into one strain. 

The present invention also provides new methods of 
attenuation. For example, individual internal genes of human 

25 RSV can be replaced with their bovine, murine or other RSV 

counterpart. This may include part or all of one or more of 
the NSl, NS2, N, P, M, SH, M2(0RF1), M2(ORF2) and L genes, or 
non- immunogenic parts of the G and F genes. Reciprocally, 
means are provided to generate a live attenuated bovine RSV by 

3 0 inserting human attenuating genes into a bovine RSV genome or 
antigenome background. Human RSV bearing bovine RSV 
glycoproteins provides a host range restriction favorable for 
human vaccine preparations. Bovine RSV sequences which can be 
used in the present invention are described in, e.g., Pastey 

35 et al., . Gen. Viol. 76:193-197 (1993); Pastey et al . , Virus 
Res . 29:195-202 (1993); Zamora et al . , J. Gen. Virol. 73:737- 
741 (1992); Mallipeddi et al . , .t a^n. Virol. 74:2001-2004 
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(1993); Mallipeddi et al . , J. Gen. Virol. 73:2441-2444 (1992); 
and Zamora et al . , Virus Res . 24:115-121 (1992), each of which 
is incorporated herein by reference. 

The invention also provides the ability to analyze 
5 other types of attenuating mutation and to incorporate them 

into infectious RSV for vaccine or other uses. For example, a 
tissue culture-adapted nonpathogenic strain of pneumonia virus 
of mice (the murine counterpart of RSV) lacks a cytoplasmic 
tail of the G protein (Randhawa et al., Virology 207: 240-245 

10 (1995)). By analogy, the cytoplasmic and transmembrane 

domains of each of the RSV glycoproteins, F, G and SH, can be 
deleted or modified to achieve attenuation. 

Other mutations for use in infectious RSV of the 
present invention include mutations in cis-acting signals 

15 identified during mutational analysis of RSV minigenomes. 

For example, insertional and deletional analysis of the leader 
and trailer and flanking sequences identified viral promoters 
and transcription signals and provided a series of mutations 
associated with varying degrees of reduction of RNA 

20 replication or transcription. Saturation mutagenesis (whereby 
each position in turn is modified to each of the nucleotide 
alternatives) of these cis-acting signals also has identified 
many mutations which reduced (or in one case increased) RNA 
replication or transcription. Any of these mutations can be 

25 inserted into the complete antigenome or genome as described 
herein. Other mutations involve replacement of the 3' end of 
genome with its counterpart from antigenome, which is 
associated with changes in RNA replication and transcription. 
In addition, the intergenic regions (Collins et al . , Proc . 

30 Natl. Acad. Sci . USA 83:4594-4598 (1986), incorporated herein 
by reference) can be shortened or lengthened or changed in 
sequence content, and the naturally-occurring gene overlap 
(Collins et al., Proc. Natl. Acad. Sci. USA 84:5134-5138 
(1987) , incorporated herein by reference) can be removed or 

3 5 changed to a different intergenic region by the methods 
described herein. 
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In another embodiment, RSV useful in a vaccine 
formulation can be conveniently modified to accommodate 
antigenic drift in circulating virus. Typically the 
modification will be in the G and/or F proteins. The entire G 
5 or F gene, or the segment (s) encoding particular immunogenic 
regions thereof, is incorporated into the RSV genome or 
antigenome cDNA by replacement of the corresponding region in 
the infectious clone or by adding one or more copies of the 
gene such that several antigenic forms are represented. 
10 Progeny virus produced from the modified RSV cDNA are then 
used in vaccination protocols against the emerging strains. 
Further, inclusion of the G protein gene of RSV subgroup B 
would broaden the response to cover a wider spectrum of the 
relatively diverse subgroup A and B strains present in the 
15 human population. 

An infectious RSV clone of the invention can also 
be engineered to enhance its immunogenic ity and induce a level 
of protection greater than that provided by natural infection, 
or vice versa, to identify and ablate epitopes associated with 
20 undesirable immunopathologic reactions. Enhanced 

immunogenic ity of the vaccines produced by the present 
invention addresses one of the greatest obstacles to 
controlling RSV, namely the incomplete nature of immunity 
induced by natural infection. An additional gene may be 
25 inserted into or proximate to the RSV genome or antigenome 
which is under the control of an independent set of 
transcription signals. Genes of interest include those 
encoding cytokines (e.g., IL-2 through IL-15, especially IL-3, 
IL-6 and IL-7, etc.), gamma- interferon , and proteins rich in T 
30 helper cell epitopes. The additional protein can be expressed 
either as a separate protein or as a chimera engineered from a 
second copy of one of the RSV proteins, such as SH . This 
provides the ability to modify and improve the immune response 
against RSV both quantitatively and qualitatively. 
3 5 For vaccine use, virus produced according to the 

present invention can be used directly in vaccine 
formulations, or lyophilized, as desired, using lyophilizat ion 
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protocols well known to the artisan. Lyophilized virus will 
typically be maintained at about 4®C. When ready for use the 
lyophilized virus is reconstituted in a stabilizing solution, 
e.g., saline or comprising SPG, Mg*" and HEPES, with or without 
5 adjuvant, as further described below. 

Thus RSV vaccines of the invention contain as an 
active ingredient an immunogenetically effective amount of RSV 
produced as described herein. The modified virus may be 
introduced into a host with a physiologically acceptable 

10 carrier and/or adjuvant. Useful carriers are well known in 
the art, and include, e.g., water, buffered water, 0,4% 
saline, 0.3% glycine, hyaluronic acid and the like. The 
resulting aqueous solutions may be packaged for use as is, or 
lyophilized, the lyophilized preparation being combined with a 

15 sterile solution prior to administration, as mentioned above. 
The compositions may contain pharmaceutically acceptable 
auxiliary substances as required to approximate physiological 
conditions, such as pH adjusting and buffering agents, 
tonicity adjusting agents, wetting agents and the like, for 

2 0 example, sodium acetate, sodium lactate, sodium chloride, 

potassium chloride, calcium chloride, sorbitan monolaurate, 
triethanolamine oleate, and the like. Acceptable adjuvants 
include incomplete Freund' s adjuvant, aluminum phosphate, 
aluminum hydroxide, or alum, which are materials well known in 
25 the art. 

Upon immunization with a RSV composition as 
described herein, via aerosol, droplet, oral, topical or other 
route, the immune system of the host responds to the vaccine 
by producing antibodies specific for RSV virus proteins, e.g., 
30 F and G glycoproteins. As a result of the vaccination the - 
host becomes at least partially or completely immune to RSV 
infection, or resistant to developing moderate or severe RSV 
infection, particularly of the lower respiratory tract. 

The host to which the vaccine are administered can 

3 5 be any mammal which is susceptible to infection by RSV or a 

closely related virus and which host is capable of generating 
a protective immune response to the antigens of the 
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vaccinizing strain. Thus, suitable hosts include humans, non- 
human primates, bovine, equine, swine, ovine, caprine, 
lagamorph, rodents, etc. Accordingly, the invention provides 
methods for creating vaccines for a variety of human and 
5 veterinary uses. 

The vaccine compositions containing the RSV of the 
invention are administered to a host susceptible to or 
otherwise at risk of RSV infection to enhance the host's own 
immune response capabilities. Such an amount is defined to be 

10 a "immunogenically effective dose." In this use, the precise 
amounts again depend on the host's state of health and weight, 
the mode of administration, the nature of the formulation, 
etc., but generally range from about 10^ to about 10^ plaque 
forming units (PFU) or more of virus per host, more commonly 

15 from about 10^ to 10^ PFU virus per host. In any event, the 

vaccine formulations should provide a quantity of modified RSV 
of the invention sufficient to effectively protect the host 
patient against serious or life- threatening RSV infection. 

The RSV produced in accordance with the present 

20 invention can be combined with viruses of the other subgroup 
or strains to achieve protection against multiple RSV 
subgroups or strains, or protective epitopes of these strains 
can be engineered into one virus as described herein. 
Typically the different viruses will be in admixture and 

25 administered simultaneously, but may also be administered 

separately. For example, as the F glycoproteins of the two 
RSV subgroups differ by only about 11% in amino acid sequence, 
this similarity is the basis for a cross-protective immune 
response as observed in animals immunized with RSV or F 

30 antigen and challenged with a heterologous strain. Thus, 
immunization with one strain may protect against different 
strains of the same or different subgroup. 

In some instances it may be desirable to combine 
the RSV vaccines of the invention with vaccines which induce 

3 5 protective responses to other agents, particularly other 
childhood viruses. For example, the RSV vaccine of the 
present invention can be administered simultaneously with 
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parainfluenza virus vaccine, such as described in Clements et 
al., J. Clin. Microbiol- 29:1175-1182 (1991), which is 
incorporated herein by reference. In another aspect of the 
invention the RSV can be employed as a vector for protective 
5 antigens of other respiratory tract pathogens, such as 

parainfluenza, by incorporating the sequences encoding those 
protective antigens into the RSV genome or antigenome which is 
used to produce infectious RSV as described herein. 

Single or multiple administrations of the vaccine 

10 compositions of the invention can be carried out. In neonates 
and infants, multiple administration may be recjuired to elicit 
sufficient levels of immunity. Administration should begin 
within the first month of life, and at intervals throughout 
childhood, such as at two months, six months, one year and two 

15 years, as necessary to maintain sufficient levels of 
protection against native (wild- type) RSV infection. 
Similarly, adults who are particularly susceptible to repeated 
or serious RSV infection, such as, for example, health care 
workers, day care workers, family members of young children, 

20 the elderly, individuals with compromised cardiopulmonary 
function, may require multiple immunizations to establish 
and/or maintain protective immune responses. Levels of 
induced immunity can be monitored by measuring amounts of 
neutralizing secretory and serum antibodies, and dosages 

25 adjusted or vaccinations repeated as necessary to maintain 
desired levels of protection. Further, different vaccine 
viruses may be advantageous for different recipient groups. 
For example, an engineered RSV strain expressing an additional 
protein rich in T cell epitopes may be particularly 

30 advantageous for adults rather than for infants. 

In yet another aspect of the invention the RSV is 
employed as a vector for transient gene therapy of the 
respiratory tract. According to this embodiment the 
recombinant RSV genome or antigenome incorporates a sequence 

35 which is capable of encoding a gene product of interest. The 
gene product of interest is under control of the same or a 
different promoter from that which controls RSV expression. 
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The infectious RSV produced by coexpressing the recombinant 
RSV genome or antigenome with the N, P, L and M2(0RF1) 
proteins and containing a sequence encoding the gene product 
of interest is administered to a patient. Administration is 
typically by aerosol, nebulizer, or other topical application 
to the respiratory tract of the patient being treated. 
Recombinant RSV is administered in an amount sufficient to 
result in the expression of therapeutic or prophylactic levels 
of the desired gene product. Examples of representative gene 
products which are administered in this method include those 
which encode, for example, those particularly suitable for 
transient expression, e.g., interleukin-2 , interleukin-4 , 
gamma- interferon, GM-CSF, G-CSF, erythropoietin, and other 
cytokines, glucocerebrosidase , phenylalanine hydroxylase, 
cystic fibrosis transmembrane conductance regulator (CFTR) , 
hypoxanthine -guanine phosphoribosyl transferase, cytotoxins, 
tumor suppressor genes, antisense RNAs, and vaccine antigens. 

The following examples are provided by way of 
illustration, not limitation. 

EXAMPLE I 

Construction of cDNA Encoding RSV Antigenome 

A cDNA clone encoding the antigenome of RSV strain 
A2 was constructed, as shown in Fig. lA. The cDNA was 
synthesized in segments by reverse transcription (RT) and 
polymerase chain reaction (PGR) using synthetic 
oligonucleotides as primers and intracellular RSV mRNA or 
genome RNA isolated from purified virions as template. The 
final cDNA was flanked on the leader end by the promoter for 
T7 RNA polymerase, which included three transcribed G residues 
for optimal activity; transcription would result in the 
donation of these three nonviral G' s to the 5' end of the 
antigenome. To generate a nearly-correct 3' end, the cDNA 
trailer end was constructed to be adjacent to a 
previously-described hammerhead ribozyme, which upon cleavage 
would donate a single 3 ' -phosphorylated U residue to the 3' 
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end of the encoded RNA (Grosfeld et al . , J. Virol. 69:5677- 
5686, incorporated herein by reference). The ribozyme 
secjuence was followed by a tandem pair of terminators of T7 
RNA polymerase. (The addition of three 5' G residues and one 
2' U residue to a cDNA-encoded RSV minigenome containing the 
chloramphenicol acetyl transferase (CAT) reporter gene had no 
effect on the expression of CAT when complemented by RSV.) 

Fig. lA shows the structures of the cDNA and the 
encoded antigenome RNA (not to scale) . The diagram of the 
antigenome (at top) includes the following features: the 
5 '-terminal nonviral G triplet contributed by the T7 promoter, 
the four sequence markers at positions 1099 (which adds one nt 
to the length) , 1139, 5611, and 7559, the ribozyme and tandem 
T7 terminators, and the single nonviral 3 ' -phosphorylated U 
residue contributed to the 3' end by ribozyme cleavage (the 
site of cleavage is indicated with an arrow) (13). 

Cloned cDNA segments (Fig. lA, middle) representing 
in aggregate the complete antigenome were constructed by 
RT-PCR of RSV mRNA or genome RNA. cDNAs containing the 
lefthand end of the antigenome, spanning from the T7 promoter 
and leader region complement to the SH gene, were assembled in 
a version of pBR322 (Fig. lA, bottom) in which the 
naturally-occurring BamHI site had been ablated by mutagenesis 
and the Pstl-EcoRI fragment replaced with a synthetic 
polylinker containing unique restriction sites (including 
BstBl, BstXI, Pad, BamHI, Mlul) designed to facilitate 
assembly. The box in Fig. lA shows the removal of the BamHI 
site. The naturally occurring BamHI-Sall fragment (the BamHI 
site is shown in the top line in positive sense, underlined) 
was replaced with a PCR-generated Bglll-Sall fragment (the 
Bglll site is shown in the bottom line, underlined; its 4-nt 
sticky end [italics] is compatible with that of BamHI) . This 
resulted in a single nt change (middle line, underlined) which 
was silent at the amino acid level. These modifications to 
the vector facilitated construction of the cDNA by rendering 
unique a BamHI site in the antigenome cDNA. 
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The G, F and M2 genes were assembled in a separate 
plasmid, as were the L, trailer and flanking ribozyme and 
tandem T7 transcription terminators. The G-to-M2 piece was 
then inserted into the PacI-BamHI window of the leader- to-SH 
plasmid. This in turn was the recipient for the 
L-trailer-ribozyme-terminator piece inserted into the BamHI to 
Mlul, yielding the complete antigenome. 

Four restriction site markers (Fig. IB) were 
introduced into the antigenome cDNA by incorporating the 
changes into oligonucleotide primers used in RT-PCR. This was 
done to facilitate assembly, provide a means to identify 
recombinant virus, and illustrate the ability to introduce 
changes into infectious RSV. Three sites were in intergenic 
regions and the fourth in a nontranslated gene region, and 
they involved a total of five nt substitutions and a single nt 
insertion. This increased the length of the encoded 
antigenome by one nt from that of wild type to a total of 
15,223 nt (SEQ ID NO : 1 , which depicts the 5' to 3 ' positive- 
sense sequence whereas the genome itself is negative -sense) . 

The sequence markers were inserted into the 
cDNA-encoded antigenome RNA as shown in Fig. IB. Sequences 
are positive sense and numbered relative to the first nt of 
the leader region complement as 1; identities between strains 
A2 and 18537 (Johnson and Collins, J. Gen Virol. 69:2901-2906 
(1988), incorporated herein by reference), representing 
subgroups A and B, respectively, are indicated with dots; 
sequences representing restriction sites in the cDNA are 
underlined; gene-start (GS) and gene-end (GE) transcription 
signals are boxed; the initiation codon of the N translational 
open reading frame at position 1141 is italicized, and the 
sequence markers are shown underneath each sequence. In the 
top sequence, a single C residue was inserted at position 1099 
to create an Aflll site in the NSII-N intergenic region, and 
the AG at positions 1139 and 1140 immediately upstream of the 
N translational open reading frame were replaced with CC to 
create a new Ncol site. In the middle sequence, substitution 
of G and U at positions 5612 and 5616, respectively, created a 



wo 97/1 2032 PCT/US96/1 5524 

26 

new StuI site in the G-F intergenic region. And, in the 
bottom sequence of Fig. IB, a C replacement at position 7561 
created a new SphI site in the F-M2 intergenic region. 

All cDNAs were sequenced in their entirety, in most 
instances from several independent cDNAs, prior to assembly. 
The plasmids encoding individual RSV proteins are described in 
Grosfeld et al., J. Virol. 69:5677-5686 (1995) and Collins et 
al., Proc. Natl. Acad. Sci. USA (1995), each of which is 
incorporated herein by reference. 



EXAMPLE II 

Transfection and Recovery of Recombinant RSV 

The strategy for producing infectious RSV from 
cDNA- expressed antigenome involved its coexpression with those 
RSV proteins which are sufficient to (i) produce an antigenome 
nucleocapsid capable of RNA replication, and (ii) render the 
progeny genome nucleocapsid competent for both RNA replication 
and transcription. Transcription by the genome nucleocapsid 
provides all of the other RSV proteins and initiates a 
productive infection . 

Plasmid-borne cDNA encoding the antigenome was 
transfected, together with plasmids encoding proteins N, P, L 
and M2(ORFl), into HEp-2 cells which had been infected with a 
recently-described vaccinia virus MVA strain recombinant which 
expresses the T7 RNA polymerase (Wyatt et al . , Virol . 210:202- 
205 (1995) , incorporated herein by reference) . The MVA strain 
is a host range mutant which grows permissively in avian cells 
whereas in mammalian cells there is a block at a late stage in 
virion maturation that greatly reduces the production of 
infectious virus. In HEp-2 cells, the MVA recombinant was 
similar to the more commonly-used WR-based recombinant (Fuerst 
et al., Proc. Natl. Acad. Sci. USA 83: 8122-8126 (1986)) with 
regard to the level of expression of T7 polymerase and 
cytopathogenicity , but the level of progeny produced was 
sufficiently low that supernatants could be passaged to fresh 
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cells with minimal cytopathogenicity . This should facilitate 
the recovery of any recombinant RSV which might be produced in 
transfected, vaccinia virus-infected cells. 

Transfection and recovery of recombinant RSV was 
5 performed as follows. Monolayer cultures of HEp-2 cells 
received, per single well of a six-well dish, one ml of 
inf ection-transfection medium prepared by mixing five plasmids 
in a final volume of 0.1 ml Opti-MEM (Life Technologies) 
medium/ namely 0.4 /ig each of antigenome, N and P plasmids, 
10 and 0.1 /xg each of L and M2 (ORFl) plasmids. This was combined 
with 0.1 ml of Opti-MEM containing 12 ^1 LipofectACE (Life 
Technologies) . After 15 min incubation at room temperature, 
this was combined with 0.8 ml of OptiMEM containing 2% 
heat -inactivated fetal calf serum and 1.5 X 10^ pfu of strain 
15 MVA vaccinia virus recombinant encoding T7 RNA polymerase 
(Wyatt et al., supra ) . This was added to the cells and 
replaced one day later by Opti-MEM containing 2% serum. 
Cultures were incubated at 32^0 and harvested on day three. 
Incubation at 32<*C was used because it was found that the MVA 
20 virus is slightly temperature sensitive and is much more 
efficient at this lower temperature. 

Three days post -transfection clarified culture 
supernatants were passaged onto fresh HEp-2 cells and overlaid 
with methyl cellulose (for subsequent antibody staining) or 
25 agarose (for plaque isolation). After incubation for five 

days under methyl cellulose, the cells were fixed and stained 
by an indirect horseradish peroxidase method using a mixture 
of three murine monoclonal antibodies to the RSV F protein 
followed by an anti-mouse antibody linked to horseradish 
3 0 peroxidase, following the general procedure of Murphy et al . , 
Vaccine 8: 497-502 (1990). 

Numerous RSV- like plaques were detected against a 
background of cytopathogenicity that presumably was due to a 
low level of MVA-T7 recombinant virus. The plaques contained 
3 5 an abundant amount of the RSV F protein, as indicated by 
brown-black coloration, and displayed cytopathic effects 
characteristic of RSV, notably syncytium formation. 



wo 97/12032 PCT/US96/15524 

28 

The RSV-like plaques were picked from plates which 
had been prepared in parallel but incubated under agarose and 
stained with neutral red. They were propagated and compared 
to a laboratory strain of RSV strain A2 by plaque assay and 
5 antibody staining. The plaques derived from the transfected 
cultures closely resembled those of the laboratory strain. 
One difference was that the plaques derived from the 
transfected cultures appeared to be slightly smaller than 
those from the laboratory strain, with centers which were less 

10 well cleared. The recombinant virus may differ phenotypically 
from this particular wild type isolate, possibly being 
slightly more restricted in cell-to-cell spread and exhibiting 
a reduced rate of cell killing. With regard to the 
propagation of released virus, the yields of the recombinant 

15 versus laboratory virus in HEp-2 cells were essentially 
identical at 32° or 37®C. In preliminary studies, the 
recombinant and laboratory viruses were indistinguishable with 
regard to the accumulation of intracellular RSV mRNAs and 
proteins . 

20 Plaque-purified, thrice-passaged recombinant RSV 

was analyzed in parallel with laboratory virus by RT-PCR using 
three primer pairs flanking the four inserted markers. Three 
independent plaque -purified recombinant RSV isolates were 
propagated in parallel with an uninfected control culture. 

25 Clarified medium supernatants were treated with polyethylene 
glycol and high salt (Zoller and Smith, DNA 3:479-488 (1984)) 
to precipitate virus and RNA was extracted from the pellets 
with TrizolTM (Life Technologies). These RNAs , in parallel 
with additional controls of no added RNA or 0.1 /ig of RNA from 

3 0 a laboratory isolate of strain A2 , were treated with DNAse, 
repurified, annealed each with 50 ng of random hexamers and 
incubated under standard RT conditions (40 ^1 reactions) with 
or without reverse transcriptase (Connors et al., Virol . 208: 
478-484 (1995)). Aliquots of each reaction were subjected to 

35 PCR (35 cycles of 94°C for 45s, 37<'C for 30s, 72*^0 for 1 min) 
using three different pairs of synthetic deoxyoligonucleotide 
primers. Primer pair (A): positive-sense, positions 925-942 
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and negative -sense, positions 1421-1440, yielding a predicted 
product of 516 bp (517 bp in the case of the recombinant 
viruses) that included the Aflll and Ncol sites inserted at, 
respectively, the junction of the NS2 and N genes and in the N 
5 gene. Primer pair (B) : positive-sense, positions 5412-5429 

and negative -sense, 5930-5949, yielding a predicted product of 
53 8 bp spanning the StuI site inserted at the junction between 
the G and F genes. Primer pair (C) : positive-sense, 7280-7297 
and negative-sense, 7690-7707, yielding a 428 bp fragment 

10 spanning the SphI site inserted at the junction between the F 
and M2 genes. PGR products were analyzed by electrophoresis 
on neutral gels containing 1% agarose and 2% low-melting 
agarose in parallel with Haelll -digested X174 DNA molecular 
length markers and visualized by staining with ethidium 

15 bromide. PGR products of the expected sizes were produced. 
The production of each was dependent on the RT step, 
indicating that each was derived from RNA rather than 
contaminating cDNA. 

PGR products were analyzed by digestion with 

20 restriction enzymes. Digestion of products of primer pair A 
with Aflll or Ncol yielded fragments corresponding to the 
predicted 177 and 340 bp (Aflll) or 217 and 300 bp (Ncol) . 
Digestion of products of primer pair B with StuI yielded 
fragments comparable to the predicted 201 and 33 7 bp. 

25 Digestion of products from reactions with primer pair G with 
SphI yielded products corresponding to the predicted 14 7 and 
281 bp. The digests were analyzed by gel electrophoresis as 
above. The presence of residual undigested PGR product with 
Aflll was due to incomplete digestion, as was confirmed by 

30 redigestion. Thus, the restriction enzyme digestion showed 

that the PGR products representing recombinant virus contained 
the expected restriction site markers while those representing 
the laboratory strain did not. Nucleotide sequence analysis 
of cloned PGR product confirmed the sequences spanning the 

35 restriction site markers. 

As shown in Table 1, the efficiency of RSV 
production when complemented by N, P, L and M2{ORFl) was 
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relatively high, ranging in three experiments from an average 
of 9.9 to 94.8 plaques per 0.4 fig of input antigenome cDNA and 
1.5 X 10^ cells. Since these plaques were derived from 
passage, the number of infected cells present in each well of 
the original transfection was not known. Nearly every 
transfected well (54 of 56 in Table 1) produced virus. Since 
the yield of released RSV per infected cell typically is very 
low («10 pfu) even under ideal conditions, and since many 
wells yielded many times this amount (up to 169 plaques in 
Table 1) , it is likely that several RSV producing cells were 
present in many of the wells of transfected cells. 

RSV was not recovered if any of the plasmids were 
omitted (e.g., as shown in Table 1). The requirement for 
M2(0RF1) also could be satisfied with the complete gene, 
M2(ORFl+2), provided the level of its input cDNA was low 
(0.016 ^xg per 1.5 X 10* cells [Table 1]). At higher levels, 
the production of virus was greatly reduced, suggesting that 
an inhibition of minigenome RNA synthesis associated with 
M2(ORF2) also operates on the complete genome during 
productive infection . 

These results showed that the production of 
infectious RSV was highly dependent on expression of the 
M2(ORFl) protein in addition to N, P and L. Furthermore, it 
showed that the optimal method of expression of M2(ORFl) was 
from an engineered cDNA in which ORF2 had been deleted, 
although the complete cDNA containing both ORFs also supported 
the production of RSV. 

Thus, as part of the present invention, 
transcription by RSV differed from previously-described 
nonsegmented negative strand RNA viruses in requiring a fourth 
protein designated here as M2(0RF1), and previously called 22K 
or M2 (Collins et al . , J. Virol. 54:65-71 (1985)). The 
M2 (ORFl) protein was found to be an RNA polymerase elongation 
factor essential for processive, sequential transcription. 
This requirement provides the capability, as part of this 
invention, for introducing specific, predetermined changes 
into infectious RSV. 
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Table 1. Production of infectious RSV was dependent on 
expression of M2 ORF 1 . 



Complementing plasmids 
(/xg cDNA per 1.5X10^ 
cells and antigenome 
cDNA) 0.4 UQ 

N{0.4) 
P(0.4) 
L(O.l) 

N(0 .4) 
P(0.4) 
L(O.l) 

M2 [ORFl+2] (0.016) 



N(0.4) 
P(0 .4) 
L(O.l) 
M2 [ORFl] 



(0.1) 



Production of infectious RSV 
#plaaues X # wells" 



expt , 1 



expt 



expt . 3 







0 


X 24 






0 


X 12 


0 


X 12 






0 


X 19 


s 




0 


X 4 


9 


X 1 






1 


X 2 






3 


X 1 


10 


X 1 






2 


X 2 






5 


X 1 


14 


X 2 






3 


X 1 






6 


X 1 


22 


X 1 














9 


X 1 


28 


X 1 




av 


0 .38 




10 


X 1 


32 


X 1 














13 


X 1 


49 


X 1 














34 


X 1 


70 


X 2 














51 


X 1 


166 


X 1 


















169 


X 1 














av . 


10.9 






















av . 


48.6 


0 


X 


1 


11 


X 


1 


0 


X 1 


55 


X 1 


1 


X 


1 


12 


X 


1 


2 


X 1 


59 


X 1 


2 


X 


2 


13 


X 


1 


4 


X 1 


65 


X 1 


3 


X 


2 


21 


X 


1 


5 


X 1 


71 


X 1 


4 


X 


1 


24 


X 


1 


8 


X 2 


72 


X 1 


5 


X 


2 


26 


X 


1 


10 


X 3 


87 


X 1 


6 


X 


4 


30 


X 


2 


19 


X 1 


97 


X 1 


7 


X 


2 


33 


X 


2 


20 


X 1 


100 


X 1 


9 


X 


1 


42 


X 


1 


23 


X 1 


109 


X 1 


10 


X 


2 


73 


X 


1 






128 


X 1 














av 


. 9.9 


147 


X 1 








av . 


13 


. 7 






148 


X 1 


















av . 


94 . 8 



• Supernatants from transfected cultures (10^ cells per well) 
were passaged onto fresh HEp-2 cells, overlaid with methyl 
cellulose, and stained with F-specific monoclonal antibodies. 

* Read as follows: 19 wells had 0 plaques, 2 wells had 1 
plaque each, 2 wells had 2 plaques each, and 1 well had 3 
plaques . 



wo 97/12032 PCT/US96/1 5524 

32 

EXAMPLE III 

Constructing Infectious RSV With Predetermined Mutations 
To Confer a Desired Phenotvpe 

5 This Example illustrates the introduction of 

specific predetermined mutations into infectious recombinant 
RSV using the methods described hereinabove. For ease of 
manipulation, the antigenome cDNA was cloned as two separate 
pieces in separate plasmids : one piece (the left end) 

10 containing the T7 promoter together with nucleotide 1 through 
to the BamHI site at nucleotide 8501 (cDNA D50) , and the other 
(the right end) containing the BamHI site through to 
nucleotide 15223, together with the ribozyme and T7 
transcription terminators (cDNA D39) . D3 9 was further 

15.. separated into two pieces and each placed in a separate 

phagemid plasmid: one piece {left hand half, cDNA LI) from the 
BamHI site to the Pmll site at nucleotide 12255, and the other 
(right hand half, cDNA L2) from the Pmll site to the end of 
the T7 terminator. The sequence positions assigned to 

20 restriction site locations are intended as a descriptive guide 
and do not alone precisely define all of the nucleotides 
involved. 

Following a general procedure of Kunkel et al . Meth . 
Enzvmol . 54:367-382 (1987) (incorporated herein by reference), 

25 the plasmids were propagated in a dut unq strain of E. coli , 
strain CJ236, and single stranded DNA was prepared by 
infection with a helper phage, M13K07. Phosphorylated 
synthetic oligonucleotides each containing one or more 
nucleotide changes of interest were prepared, annealed to the 

30 single stranded template singly or more than one at a time, 
and used to direct DNA synthesis by T4 DNA polymerase. The 
products were ligated and transformed into a non-dut unq 
strain of E, coli > DHSof or DHIOB. Colonies containing mutant 
plasmids were identified by restriction enzyme digestion or 

35 by sequence analysis. Other methods of mutagenesis can 
readily be used. 
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LI and L2 described above were modified to contain 
several combinations of recognition sites for several 
different restriction enzymes which do not appear or are 
infrequent in the antigenome plasmid; these sites were 
introduced using nucleotide substitutions which did not change 
the amino acid sequence of the encoded L protein. In 
addition, LI was modified to contain a mutation believed to 
confer a ts phenotype . Two versions of LI were made. In one 
version, LI was modified in a single cycle of mutagenesis to 
contain new Bsu36I (nucleotide 9399) and SnaBI (11848) sites, 
and a mutation termed 530, yielding cDNA 530Llsites. The 530 
mutation had been identified by sequence analysis of the 
biologically-derived virus cpts53 0-RSV and involves a single 
nucleotide change at position 1006 0 which results in an amino 
acid change (Phe to Leu) at amino acid 521 in the L protein. 
In a second version, LI was modified in a single cycle to 
contain the Bsu36I and SnaBI sites, resulting in cDNA LI 
sites. L2 was modified in one cycle of mutagenesis to contain 
the new sites Pmel (13342), RsrII (14083) and SnaBI (14477). 
In a second cycle, the site BstEII (14318) was added and a 
naturally-occurring recognition site for SnaBI (6956) was 
removed. This yielded L2 sites. 

Three complete antigenome cDNAs were made by 
introducing selected LI and L2 mutant cDNAs or fragments 
thereof into D39 and combining this with D50. Antigenome cDNA 
"D53sites" contains LI sites and L2 sites. cDNA "530D53" 
contains the BamHI-Spel (10149) fragment of 530Llsites (which 
contains the Bsu36I and 530 mutations) . cDNA " 530D53sites" 
contained 530L1 sites and L2 sites (Table II) . Recombinant 
virus was recovered from each of the three complete mutant 
antigenome cDNAs using the methodology of this invention and 
were passaged at least twice and analyzed directly or 
following plaque purification and amplification. The presence 
of mutations was confirmed by RT-PCR of viral RNA followed by 
analysis by restriction enzyme digestion or nucleotide 
sequencing, or both. 
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The engineered viruses were evaluated for their 
ability to form plaques in HEp-2 cells at 32^C, 39°C and 40^C 
in parallel with two nonrecotnbinant biologically-derived 
viruses, HEK, a wild type strain A2 virus, and cpts530 RSV, 
the virus from which the 530 mutation was identified by 
sequence analysis (Table II) . This comparison showed that all 
three engineered viruses formed plaques at 3 2 °C, and showed 
that the titers of the various virus preparations were within 
two log^o units of each other, which is within the range of 
experimental variation typically seen among independent 
preparations of RSV. The recombinant viruses containing the 
530 mutation were greatly impaired in ability to form plaques 
at 39°C or 40*^C, comparable to cpts530 RSV, The presence of 
additional restriction sites in 530D53sites versus 530D53 had 
no discernable effect on the ts phenotype . D53sites virus, 
which contained silent restriction site markers but lacked the 
530 mutation, retained the ability to form plaques at the 
higher temperatures, comparable to wild type. This not only 
provided positive identification that the 530 mutation is 
involved in the ts phenotype of cpts53 0 RSV, but also showed 
that point mutations can be introduced systematically into 
recombinant RSV according to the present invention. In these 
cases, the resulting phenotypes of the engineered viruses were 
fully consistent with the parental strain and provided direct 
confirmation and reconst itution of an attenuation mutation. 
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Table II 

Characterization of the ts phenotype of biologically- 
derived RSV versus RSV recovered from cDNA clones 



virus 


Efficiency of 
(logio) the 


plaque formation 
indicated temperature 


32°C 


390c 40°C 


Bioloaicallv- derived 


viiruses 




HEK* 


8,7 


8,6 8.5 


cpt:s53 0RSV^ 


6.8 


<0.7 <0.7 



15 cDNA-derived viruses 
D53 sites'= 
530D53^ 
530D53 sites*' 

20 * = Wild type RSV A2 , 
^ = ts virus • 

^ = Contains six new restriction sites and lacks one 
naturally-occurring site- 

*^ = Contains one new restriction site and the 53 0 mutation. 
25 ^ = Contains six new restriction sites and the 530 mutation 
and lacks one naturally-occurring site. 



3 0 EXAMPLE IV 

Recovery of Infectious Respiratory Syncyt ial Virus 
Expressing An Additional, Foreign Gene 
The methods described above were used to construct 
recombinant RSV containing an additional gene, encoding 
3 5 chloramphenicol acetyl transferase (CAT) . The CAT coding 

sequence was flanked by RSV-specific gene -start and gene-end 
motifs, the transcription signals for the viral RNA-dependent 
RNA polymerase- The RSV/CAT chimeric transcription cassette 
was inserted into the intergenic region between the G and F 
40 genes of the complete cDNA-encoded positive-sense RSV 

antigenome, and infectious CAT -expressing recombinant RSV was 
recovered. The CAT mRNA was efficiently expressed and the 



6,9 6.9 6.7 

7,9 <0.7 <0.7 

7,6 <0,7 <0,7 
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levels of the G and F mRNAs were comparable to those expressed 
by wild type recombinant RSV. The CAT- containing and wild 
type viruses were similar with regard to the levels of 
synthesis of the major viral proteins, 
5 Plasmid D4 6 was used for construction of cDNA 

encoding RSV antigenomic RNA containing the CAT gene. 
{Plasmids D46 and D50, the latter mentioned in Example III, 
are different preparations of the same antigenome cDNA. ) D46 
which encodes the complete, 15 , 223 -nucleotide RSV antigenome 

10 (one nucleotide longer than that of wild type RSV) and was 
used to produce recombinant infectious RSV described above. 
During its construction, the antigenome cDNA had been modified 
to contain four new restriction sites as markers. One of 
these, a StuI site placed in the intergenic region between the 

15 G and F genes (positions 5611-5616 in the 3' -5' sequence of 

the wild type genome), was chosen as an insertion site for the 
foreign CAT gene. A copy of the CAT ORF flanked on the 
upstream end by the RSV GS signal and on the downstream end by 
the RS GE signal was derived from a previously-described RSV- 

20 CAT minigenome (Collins et al., Proc . Natl. Acad. Sci . USA 
88:9663-9667 (1991) and Kuo et al . , J. Virol. 70: 6892-6901 
(1996) , incorporated by reference herein) . The insertion of 
this RSV/CAT transcription cassette into the StuI site, to 
yield the D46/1024CAT cDNA, increased the length of the 

25 encoded antigenome to a total of 15,984 nucleotides. And, 

whereas wild type RSV encodes ten major subgenomic mRNAs, the 
recombinant virus predicted from the D46/1024CAT antigenome 
would encode the CAT gene as an eleventh mRNA. The strategy 
of construction is shown in Fig. 2. 

3 0 Producing infectious RSV from cDNA- encoded 

antigenomic RNA, as described above, involved coexpression in 
HEP-2 cells of five cDNAs separately encoding the antigenomic 
RNA or the N, P, L or M2 (ORFl) protein, which are necessary 
and sufficient for viral RNA replication and transcription. 

3 5 cDNA expression was driven by T7 RNA polymerase supplied by a 
vaccinia-T7 recombinant virus based on the MVA strain. The 
MVA-T7 recombinant virus produced infectious progeny 
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sufficient to cause extensive cytopathogenicity upon passage, 
and therefore, cytosine arabinoside, an inhibitor of vaccinia 
virus replication, was added 24 h following the transfection 
and maintained during the first six passages. 

Two antigenome cDNAs were tested for the recovery of 
RSV: the D46 cDNA, and the D46/1024CAT cDNA. Each one yielded 
infectious recombinant RSV. Cells infected with the D46/ 
1024CAT recombinant virus expressed abundant levels of CAT 
enzyme. For each virus, transfection supernatants were 
passaged to fresh cells, and a total of eight serial passages 
were performed at intervals of five to six days and a 
multiplicity of infection of less than 0.1 PFU per cell. 

The CAT sequence in the D46/1024CAT genome was 
flanked by RSV GS and GE signals, and thus should be expressed 
as an additional, separate, polyadenylated mRNA. The presence 
of this predicted mRNA was tested by Northern blot 
hybridization of RNA from cells infected with D46/1024CAT 
virus or D46 virus at the eighth passage. Hybridization with 
a negative-sense CAT-specific riboprobe detected a major band 
which was of the appropriate size to be the predicted CAT 
mRNA, which would contain 735 nucleotides not including 
poly (A) . This species was completely retained by oligo(dT) 
latex particles, showing that it was polyadenylated. In some 
cases, a minor larger CAT-specific species was detected which 
was of the appropriate size to be a G-CAT readthrough mRNA. 
The D46/1024CAT virus had been subjected to eight passages at 
low multiplicity of infection prior to the infection used for 
preparing the intracellular RNA. There was no evidence of 
shorter forms of the CAT mRNA, as might have arisen if the CAT 
gene was subject to deletion. 

Replicate blots were hybridized with negative-sense 
riboprobe specific to the CAT, SH, G or F gene, the latter two 
genes flanking the inserted CAT gene. The blots showed that 
the expression of the subgenomic SH, G and F mRNAs was similar 
for the two viruses. Phosphoimagery was used to compare the 
amount of hybridized radioactivity in each of the three RSV 
mRNA bands for D46/1024CAT and D46. The ratio of 
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radioactivity between D46/1024CAT and D46 was determined for 
each ruRNA: SH, 0.77; G, 0.87; and F, 0.78. The deviation from 
unity probably indicates that slightly less RNA was loaded for 
D46/1024CAT versus D46, although it also is possible that the 
5 overall level of mRNA accumulation was slightly less for 

D46/1024CAT RSV. The demonstration that the three ratios were 
similar confirms that the level of expression of each of these 
mRNAs was approximately the same for D46/1024CAT versus D46 . 
This, the insertion of the CAT gene between the G and F genes 
10 did not drastically affect the level of transcription of 
either gene. 

To characterize viral protein synthesis, infected 
HEp-2 cells were labeled with [^^S] methionine , and cell lysates 
were analyzed by PAGE either directly or following 

15 immunoprecipitation under conditions where recovery of labeled 
antigen was essentially complete. Precipitation with a rabbit 
antiserum raised against purified RSV showed that the 
D46/1024CAT and D46 viruses both expressed similar amounts of 
the major viral proteins F^, N, P, M, and M2 . That a similar 

20 level of M2 protein was recovered for each virus was 

noteworthy because its gene is downstream of the inserted CAT 
gene. Accumulation of the F protein, which is encoded by the 
gene located immediately downstream of the insertion, also was 
examined by immunoprecipitation with a mixture of three anti-F 

25 monoclonal antibodies. A similar level of the F^ subunit was 
recovered for each virus. Phosphorimagery analysis of the 
major viral proteins mentioned above was performed for several 
independent experiments and showed that some sample- to-sample 
variability, but overall the two viruses could not be 

30 distinguished on the basis of the level of recovered proteins. 
Precipitation with anti-CAT antibodies recovered a single 
species for the D46/1024CAT but not for the D46 virus. 
Analysis of the total labeled protein showed that the N, P and 
M proteins could be detected without immunoprecipitation 

35 {although detection of the latter was complicated by its 

comigration with a cellular species) and confirmed that the 
two viruses yielded similar patterns. The position 
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corresponding to that of the CAT protein contained more 
radioactivity in the D46/1024CAT pattern compared to that of 
D46, as was confirmed by phosphorimagery of independent 
experiments. This suggested that the CAT protein could be 
5 detected among the total labeled proteins without 

precipitation, although this demonstration was complicated by 
the presence of a comigrating background band in the 
uninfected and D4 6 -infected patterns. 

RT-PCR was used to confirm the presence of the CAT 

10 gene in the predicted location of the genome of recombinant 
RSV. Total intracellular RNA was isolated from the cell 
pellet of passage eight of both D46/1024CAT and D46 RSV, Two 
primers were chosen that flank the site of insertion, the Stul 
restriction endonuclease site at RSV positions 5611-5616: the 

15 upstream positive-sense primer corresponded to positions 5412- 
5429, and the downstream negative-sense one to positions 5730- 
5711. The positive-sense primer was used for the RT step, and 
both primers were included in the PCR. 

RT-PCR of the D46 virus yielded a single product 

20 that corresponded to the predicted fragment of 318 

nucleotides, representing the G/F gene junction without 
additional foreign sequence. Analysis of D46/1024CAT viral 
RNA yielded a single product whose electrophoretic mobility 
corresponded well with the predicted 1079 nucleotide fragment, 

25 representing the G/F gene junction containing the inserted CAT 
transcription cassette. The latter PCR yielded a single major 
band; the absence of detectable smaller products indicated 
that the population of recombinant genomes did not contain a 
large number of molecules with a deletion in this region. 

30 When PCR analysis was performed on D46/1024CAT virus RNA 

without the RT step, no band was seen, confirming that the 
analysis was specific to RNA. Thus, the RT-PCR analysis 
confirmed the presence of an insert of the predicted length in 
the predicted location in the genomic RNA of the D46/1024CAT 

35 recombinant virus. 

Enzyme expression was used to measure the stability 
of the CAT gene. Cell pellets from all of the passages 
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beginning with the third were tested for CAT expression. For 
the virus D46/1024CAT, all these assays displayed conversion 
of [^*C] labeled chloramphenicol into acetylated forms. To 
investigate stability of expression, virus from 20 or 25 
5 individual placjues from passage three or eight, respectively, 
was analyzed for CAT expression. All samples were positive, 
and the level of expression of CAT was similar for each of the 
25 isolates from passage eight, as judged by assay of 
ecjuivalent aliquots of cell lysate. This demonstrated that 

10 the activity of the CAT protein encoded by each isolate 
remained unimpaired by mutation. 

To determine plaque morphology and size, beginning 
with the second passage, one-eighth of the medium supernatant 
(i.e., 0.5 ml) harvested from each passage stage was used to 

15 infect. fresh HEp-2 cells in six-well plates that were 

incubated under methylcellulose overlay for five to seven 
days. The cells were then fixed and stained by incubation 
with monoclonal antibodies against RSV F protein followed by a 
second antibody linked to horseradish peroxidase. Earlier, it 

2 0 had been observed that recombinant RSV produced from cDNA D4 6 
was indistinguishable from a naturally-occurring wild type RSV 
isolate with regard to efficiency of plaque formation over a 
range of temperatures in vitro, and the ability to replicate 
and cause disease when inoculated into the respiratory tract 

25 of previously uninfected chimpanzees. Thus, the D46 

recombinant RSV was considered to be a virulent wild type 
strain. The plaques produced by the D46 and D46/1024CAT 
recombinant viruses were compared by antibody staining. 
Plaque morphology was very similar for the two viruses, 

30 although the average diameter of the CAT-containing 

recombinant plaques was 90 percent of that of the D46 virus, 
based on measurement of thirty randomly- selected plaques for 
each virus. 

The efficiency of replication in tissue culture of 
35 the D46 and D46/1024CAT viruses was compared in a single step 
growth cycle. Triplicate monolayers of cells were infected 
with either virus, and samples were taken at 12 h intervals 
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and quantitated by plaque assay. The results showed that the 
production of D46/1024CAT virus relative to D46 was delayed 
and achieved a maximum titer which was 20 -fold lower. 

These results show that it is possible to construct 
5 recombinant, helper -independent RSV expressing a foreign gene, 
in this instance the CAT gene. The recombinant RSV directed 
expression of the predicted polyadenylated subgenomic mRNA 
that encoded CAT protein, the protein being detected both by 
enzyme assay and by radioimmunoprecipitation . Other examples 

10 have produced RSV recombinants with the luc if erase gene 

inserted at the same CAT site, or with the CAT or lucif erase 
genes inserted between the SH and G genes. These viruses also 
exhibit reduced growth, whereas the numerous wild type 
recombinant viruses recovered exhibit undiminished growth. 

15 This indicates that the reduced growth indeed is associated 
with the inserted gene rather than being due to chance 
mutation elsewhere in the genome. The finding that insertion 
of a foreign gene into recombinant RSV reduced its level of 
replication and was stable during passage in vitro suggests 

20 that this provides yet another means for effecting attenuation 
for vaccine use. And, these results demonstrate that the 
methodology described herein is capable of recovering a virus 
that is restricted in growth. 

These results also illustrate an advantage of the 

25 strategy of gene expression of the nonsegmented negative 

strand viruses, namely that the foreign coding sequences can 
be introduced as a separate transcription cassette that is 
expressed as a separate mRNA. The results also show that RSV 
can tolerate an increase of genome length of 762 nucleotides 

30 in the case of the CAT gene to a total of 15,984 nucleotides 
(1.05 times that of wild type RSV) . The lucif erase gene that 
was successfully recovered is almost three times longer. 

The viral RNA-dependent RNA polymerases are known to 
have an error-prone nature due to the absence of proofreading 

35 and repair mechanisms. In RNA virus genomes, the frequency of 
mutation is estimated to be as high as IQ-'' - 10*^ per site on 
average (Holland et al . , Curr. To p- Microbiol. Immunol. 176:1- 
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20 (1992) and references therein) . In the case of the 
recombinant D46/1024CAT RSV produced here, correct expression 
of the foreign gene would be irrelevant for virus replication 
and would be free to accumulate mutations. The passages 
5 described here involved a multiplicity of infection less than 
0.1 PFU per cell, and the duration of each passage level 
indicated that multiple rounds of infection were involved. 
While yields of infectious virus from RSV- infected tissue 
culture cells typically are low, intracellular macromolecular 

10 synthesis is robust, and the poor yields of infectious virus 
seems to represent an inefficient step in packaging rather 
than low levels of RNA replication. Thus, the maintenance of 
CAT through eight serial passages involved many rounds of RNA 
replication. It was surprising that the nonessential CAT gene 

15 remained intact and capable of encoding fully functional 
protein in each of the 25 isolates tested at the eighth 
passage. Also, RT-PCR analysis of RNA isolated from passage 
eight did not detect deletions within the CAT gene. 

Because most of the antigenic difference between the 

20 two RSV antigenic subgroups resides in the G glycoprotein, 

recombinant RSV can be constructed to express the G protein of 
the heterologous subgroup as an additional gene to yield a 
divalent vaccine. Envelope protein genes of some other 
respiratory viruses, such as human parainfluenza 3 virus, also 

2 5 can be inserted for expression by recombinant RSV. Other uses 

include coexpression of immune modulators such as inter leukin 
6 to enhance the immunogenicity of infectious RSV. Other 
uses, such as employing modified RSV as described herein as a 
vector for gene therapy, are also provided. 

3 0 Although the foregoing invention has been described 

. in some detail by way of illustration and example for purposes 
of clarity of understanding, it will be obvious that certain 
changes and modifications may be practiced within the scope of 
the appended claims. 



•I i 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: The Government of the United States of America, as 
represented by the Secretary of the Department of Health and Human 
Services 

(B) STREET: Box OTT 

(C) CITY: Bethesda 

(D) STATE: Maryland 

(E) COUNTRY: United States of America 

(F) POSTAL CODE (ZIP) ; 20892 

(G) TELEPHONE: (301) 496-7056 

(H) TELEFAX: (301) 402-0220 

(I) TELEX: 



(ii) TITLE OF INVENTION: PRODUCTION OF INFECTIOUS RESPIRATORY 
SYNCYTIAL VIRUS FROM CLONED NUCLEOTIDE SEQUENCES 

(iii) NUMBER OF SEQUENCES: 1 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/007,083 

(B) FILING DATE: 27-SEP-1995 

(vii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Parmelee, Steven W. 

(B) REGISTRATION NUMBER: 31,990 

(C) REFERENCE/DOCKET NUMBER: 15280-250- IPC 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 206-467-9600 

(B) TELEFAX: 415-576-0300 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ACGSGAAAAA ATGCGTACAA CAAACTTGCA TAAACCAAAA AAATGGGGCA AATAAGAATT 60 
TGATAAGTAC CACTTAAATT TAACTCCCTT GGTTAGAGAT GGGCAGCAAT TCATTGAGTA 12 0 
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TGATAAAAGT TAGATTACAA AATTTGTTTG ACAATGATGA AGTAGCATTG TTAAAAATAA 180 

CATGCTATAC TGATAAATTA ATACATTTAA CTAATGCTTT GGCTAAGGCA GTGATACATA 24 0 

CAATCAAATT GAATGGCATT GTGTTTGTGC ATGTTATTAC AAGTAGTGAT ATTTGCCCTA 3 00 

ATAATAATAT TGTAGTAAAA TCCAATTTCA CAACAATGCC AGTACTACAA AATGGAGGTT 3 60 

ATATATGGGA AATGATGGAA TTAACACATT GCTCTCAACC TAATGGTCTA CTAGATGACA 42 0 

ATTGTGAAAT TAAATTCTCC AAAAAACTAA GTGATTCAAC AATGACCAAT TATATGAATC 48 0 

AATTATCTGA ATTACTTGGA TTTGATCTTA ATCCATAAAT TATAATTAAT ATCAACTAGC 540 

AAATCAATGT CACTAACACC ATTAGTTAAT ATAAAACTTA ACAGAAGACA AAAATGGGGC 600 

AAATAAATCA ATTCAGCCAA CCCAACCATG GACACAACCC ACAATGATAA TACACCACAA 66 0 

AGACTGATGA TCACAGACAT GAGACCGTTG TCACTTGAGA CCATAATAAC ATCACTAACC 72 0 

AGAGACATCA TAACACACAA ATTTATATAC TTGATAAATC ATGAATGCAT AGTGAGAAAA 780 

CTTGATGAAA AGCAGGCCAC ATTTACATTC CTGGTCAACT ATGAAATGAA ACTATTACAC 84 0 

AAAGTAGGAA GCACTAAATA TAAAAAATAT ACTGAATACA ACACAAAATA TGGCACTTTC 9 00 

CCTATGCCAA TATTCATCAA TCATGATGGG TTCTTAGAAT GCATTGGCAT TAAGCCTACA 960 

AAGCATACTC CCATAATATA CAAGTATGAT CTCAATCCAT AAATTTCAAC ACAATATTCA 1020 

CACAATCTAA AACAACAACT CTATGCATAA CTATACTCCA TAGTCCAGAT GGAGCCTGAA 1080 

AATTATAGTA ATTTAAAACT TAAGGAGAGA TATAAGATAG AAGATGGGGC AAATACAACC 114 0 

ATGGCTCTTA GCAAAGTCAA GTTGAATGAT ACACTCAACA AAGATCAACT TCTGTCATCC 1200 

AGCAAATACA CCATCCAACG GAGCACAGGA GATAGTATTG ATACTCCTAA TTATGATGTG 126 0 

CAGAAACACA TCAATAAGTT ATGTGGCATG TTATTAATCA CAGAAGATGC TAATCATAAA 132 0 

TTCACTGGGT TAATAGGTAT GTTATATGCG ATGTCTAGGT TAGGAAGAGA AGACACCATA 13 8 0 

AAAATACTCA GAGATGCGGG ATATCATGTA AAAGCAAATG GAGTAGATGT AACAACACAT 144 0 

CGTCAAGACA TTAATGGAAA AGAAATGAAA TTTGAAGTGT TAACATTGGC AAGCTTAACA 1500 

ACTGAAATTC AAATCAACAT TGAGATAGAA TCTAGAAAAT CCTACAAAAA AATGCTAAAA 1560 

GAAATGGGAG AGGTAGCTCC AGAATACAGG CATGACTCTC CTGATTGTGG GATGATAATA 162 0 

TTATGTATAG CAGCATTAGT AATAACTAAA TTAGCAGCAG GGGACAGATC TGGTCTTACA 1680 

GCCGTGATTA GGAGAGCTAA TAATGTCCTA AAAAATGAAA TGAAACGTTA CAAAGGCTTA 174 0 

CTACCCAAGG ACATAGCCAA CAGCTTCTAT GAAGTGTTTG AAAAACATCC CCACTTTATA 1800 

GATGTTTTTG TTCATTTTGG TATAGCACAA TCTTCTACCA GAGGTGGCAG TAGAGTTGAA 186 0 

GGGATTTTTG CAGGATTGTT TATGAATGCC TATGGTGCAG GGCAAGTGAT GTTACGGTGG 1920 

GGAGTCTTAG CAAAATCAGT TAAAAATATT ATGTTAGGAC ATGCTAGTGT GCAAGCAGAA 1980 

ATGGAACAAG TTGTTGAGGT TTATGAATAT GCCCAAAAAT TGGGTGGTGA AGCAGGATTC 2 04 0 

TACCATATAT TGAACAACCC AAAAGCATCA TTATTATCTT TGACTCAATT TCCTCACTTC 210 0 

TCCAGTGTAG TATTAGGCAA TGCTGCTGGC CTAGGCATAA TGGGAGAGTA CAGAGGTACA 2160 
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CCGAGGAATC 


AAGATCTATA 


TGATGCAGCA 


AAGGCATATG 


CTGAACAACT 


CAAAGAAAAT 


2220 


GGTGTGATTA 


ACTACAGTGT 


ACTAGACTTG 


ACAGCAGAAG 


AACTAGAGGC 


TATCAAACAT 


2280 


CAGCTTAATC 


CAAAAGATAA 


TGATGTAGAG 


CTTTGAGTTA 


ATAAAAAATG 


GGGCAAATAA 


2340 


ATCATCATGG 


AAAAGTTTGC 


TCCTGAATTC 


CATGGAGAAG 


ATGCAAACAA 


CAGGGCTACT 


2400 


AAATTCCTAG 


AATCAATAAA 


GGGCAAATTC 


ACATCACCCA 


AAGATCCCAA 


GAAAAAAGAT 


2460 


AGTATCATAT 


CTGTCAACTC 


AATAGATATA 


GAAGTAACCA 


AAGAAAGCCC 


TATAACATCA 


2520 


AATTCAACTA 


TTATCAACCC 


AACAAATGAG 


ACAGATGATA 


CTGCAGGGAA 


CAAGCCCAAT 


2560 


TATCAAAGAA 


AACCTCTAGT 


AAGTTTCAAA 


GAAGACCCTA 


CACCAAGTGA 


TAATCCCTTT 


2640 


TCTAAACTAT 


ACAAAGAAAC 


CATAGAAACA 


TTTGATAACA 


ATGAAGAAGA 


ATCCAGCTAT 


2700 


TCATACGAAG 


AAATAAATGA 


TCAGACAAAC 


GATAATATAA 


CAGCAAGATT 


AGATAGGATT 


2760 


GATGAAAAAT 


TAAGTGAAAT 


ACTAGGAATG 


CTTCACACAT 


TAGTAGTGGC 


AAGTGCAGGA 


2820 


CCTACATCTG 


CTCGGGATGG 


TATAAGAGAT 


GCCATGGTTG 


GTTTAAGAGA 


AGAAATGATA 


2880 


GAAAAAATCA 


GAACTGAAGC 


ATTAATGACC 


AATGACAGAT 


TAGAAGCTAT 


GGCAAGACTC 


2940 


AGGAATGAGG 


AAAGTGAAAA 


GATGGCAAAA 


GACACATCAG 


ATGAAGTGTC 


TCTCAATCCA 


3000 


ACATCAGAGA 


AATTGAACAA 


CCTATTGGAA 


GGGAATGATA 


GTGACAATGA 


TCTATCACTT 


3060 


GAAGATTTCT 


GATTAGTTAC 


CAATCTTCAC 


ATCAACACAC 


AATACCAACA 


GAAGACCAAC 


3120 


AAACTAAC CA 


ACCCAATCAT 


CCAACCAAAC 


ATCCATCCGC 


CAATCAGCCA 


AACAGCCAAC 


3180 


AAAACAACCA 


GCCAATCCAA 


AACTAACCAC 


CCGGAAAAAA 


TCTATAATAT 


AGTTACAAAA 


3240 


AAAGGAAAGG 


GTGGGGCAAA 


TATGGAAACA 


TACGTGAACA 


AGCTTCACGA 


AGGCTCCACA 


3300 


T AC AC AG CTG 


CTGTTCAATA 


CAATGTCTTA 


GAAAAAGACG 


ATGACCCTGC 


ATCACTTACA 


3360 




CCATGTTCCA 


ATCATCTATG 


CCAGCAGATT 


TACTTATAAA 


AGAACTAGCT 


3420 


AATGTCAACA 


TACTAGTGAA 


ACAAATATCC 


ACACCCAAGG 


GACCTTCACT 


AAGAGTCATG 


3480 


ATAAACT C AA 


GAAGTGCAGT 


GCTAGCACAA 


ATGCCCAGCA 


AATTTACCAT 


ATGCGCTAAT 


3540 


GTGTCCTTGG 


ATGAAAGAAG 


CAAACTAGCA 


TATGATGTAA 


CCACACCCTG 


TGAAATCAAG 


3600 


GCATGTAGTC 


TAACATGCCT 


AAAATCAAAA 


AATATGTTGA 


CTACAGTTAA 


AGATCTCACT 


3660 


ATGAAGACAC 


TCAACCCTAC 


ACATGATATT 


ATTGCTTTAT 


GTGAATTTGA 


AAACATAGTA 


3720 


ACATCAAAAA 


AAGTCATAAT 


ACCAACATAC 


CTAAGATCCA 


TCAGTGTCAG 


AAATAAAGAT 


3780 


CTGAACACAC 


TTGAAAATAT 


AACAACCACT 


GAATTCAAAA 


ATGCTATCAC 


AAATGCAAAA 


3840 


ATCATCCCTT 


ACTCAGGATT 


AC TAT. A ACa 1 U 






AGGAGCATTC 


3900 


AAATACATAA 


AGCCACAAAG 


TCAATTCATA 


GTAGATCTTG 


GAGCTTACCT 


AGAAAAAGAA 


3960 


AGTATATATT 


ATGTTACCAC 


AAATTGGAAG 


CACACAGCTA 


CACGATTTGC 


AATCAAACCC 


4020 


ATGGAAGATT 


AAccrrrri'c 


CTCTACATCA 


GTGTGTTAAT 


TCATACAAAC 


TTTCTACCTA 


4080 


CATTCTTCAC 


TTCACCATCA 


CAATCACAAA 


CACTCTGTGG 


TTCAACCAAT 


CAAACAAAAC 


4140 


TTATCTGAAG 


TCCCAGATCA 


TCCCAAGTCA 


XTGTTTATCA 


GATCTAGTAC 


TCAAATAAGT 


4200 
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TAATAAAAAA TATACACATG 
TCTGTTAACA TAGACAAGTC 
AACAATAGAA TTCTCAAGCA 
AATAATCTCT TTGCTAATCA 
ATATAACGTA TTCCATAACA 
TTCATCAATC CAACAGCCCA 
CTTTACAACA CCTCATTAAC 
AATTAAAAAT AGTCATAACA 
ATGCAAACAT GTCCAAAAAC 
ACACTCTCAA TCATTTATTA 
TAGCACAAAT CACATTATCC 
CCATCATATT CATAGCCTCG 
ATGCAACAAG CCAGATCAAG 
GAATCAGTCC CTCTAATCCG 
CAACACCAGG AGTCAAGTCA 
CAACTCAAAC ACAACCCAGC 
AACCCAATAA TGATTTTCAC 
ACAATCCAAC CTGCTGGGCT 
CCACTACCAA GCCCACAAAA 
AAACCACTAA ATCAAAGGAA 
CCACCAAAAC AAACATCATA 
TCACAAGTCA AATGGAAACC 
AAGTCTCTAC AACATCCGAG 
AGTAGTTACT TAAAAACATA 
TAAACTCTGG GGCAAATAAC 
ATCCTCACTG CAGTCACATT 
CAATCAACAT GCAGTGCAGT 
ACCAGTGTTA TAACTATAGA 
GCTAAGGTAA AATTGATT^AA 
CAGTTGCTCA TGCAAAGCAC 
TTTATGAATT ATACACTCAA 
AAAAGAAGAT TTCTTGGTTT 
GTATCTAAGG TCCTGCACCT 
ACAAACAAGG CTGTAGTCAG 
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GGGCAAATAA 


TCATTGGAGG 


AAATCCAACT 


AATCACAATA 


4260 


CACACACCAT 


ACAGAATCAA 


CCAATGGAAA 


ATACATCCAT 


4 320 


AATTCTGGCC 


TTACTTTACA 


CTAATACACA 


TGATCACAAC 


4380 


TAATCTCCAT 


CATGATTGCA 


ATACTAAACA 


AACTTTGTGA 


4440 


AAACCTTTGA 


GTTACCAAGA 


GCTCGAGTCA 


ACACATAGCA 


4500 


AAACAGTAAC 


CTTGCATTTA 


AAAATGAACA 


ACCCCTACCT 


4560 


ATCCCACCAT 


GCAAACCACT 


ATCCATACTA 


TAAAGTAGTT 


4620 


ATGAACTAGG 


ATATCAAGAC 


TAACAATAAC 


ATTGGGGCAA 


4680 


AAGGACCAAC 


GCACCGCTAA 


GACATTAGAA 


AGGACCTGGG 


4740 


TTCATATCAT 


CGTGCTTATA 


TAAGTTAAAT 


CTTAAATCTG 


4800 


ATTCTGGCAA 


TGATAATCTC 


AACTTCACTT 


ATAATTGCAG 


4860 


GCAAACCACA 


AAGTCACACC 


AACAACTGCA 


ATCATACAAG 


4920 


AACACAACCC 


CAACATACCT 


CACCCAGAAT 


CCTCAGCTTG 


4980 


TCTGAAATTA 


CATCACAAAT 


CACCACCATA 


CTAGCTTCAA 


5040 


ACCCTGCAAT 


CCACAACAGT 


CAAGACCAAA 


AACACAACAA 


5100 


AAGCCCACCA 


CAAAACAACG 


CCAAAACAAA 


CCACCAAGCA 


5160 


TTTGAAGTGT 


TCAACTTTGT 


ACCCTGCAGC 


ATATGCAGCA 


5220 


ATCTGCAAAA 


GAATACCAAA 


CAAAAAACCA 


GGAAAGAAAA 


5280 


AAACCAACCC 


TCAAGACAAC 


CAAAAAAGAT 


CCCAAACCTC 


5340 


GTACCCACCA 


CCAAGCCCAC 


AGAAGAGCCA 


ACCATCAACA 


5400 


ACTACACTAC 


TCACCTCCAA 


CACCACAGGA 


AATCCAGAAC 


5460 


TTCCACTCAA 


CTTCCTCCGA 


AGGCAATCCA 


AGCCCTTCTC 


5520 


TACCCATCAC 


AACCTTCATC 


TCCACCCAAC 


ACACCACGCC 


5580 


TTATCACAAA 


AGGCCTTGAC 


CAACTTAAAC 


AGAATCAAAA 


5640 


AATGGAGTTG 


CTAATCCTCA 


AAGCAAATGC 


AATTACCACA 


5700 


TTGTTTTGCT 


TCTGGTCAAA 


ACATCACTGA 


AGAATTTTAT 


5760 


TAGCAAAGGC 


TATCTTAGTG 


CTCTGAGAAC 


TGGTTGGTAT 


5820 


ATTAAGTAAT 


ATC?UVGAAAA 


ATAAGTGTAA 


TGGAACAGAT 


5880 


ACAAGAATTA 


GATAAATATA 


AAAATGCTGT 


AACAGAATTG 


5940 


ACAAGCAACA 


AACAATCGAG 


CCAGAAGAGA 


ACTACCAAGG 


6000 


CAATGCCAAA 


AAAACCAATG 


TAACATTAAG 


CAAGAAAAGG 


6060 


TTTGTTAGGT 


GTTGGATCTG 


CAATCGCCAG 


TGGCGTTGCT 


6120 


AGAAGGGGAA 


GTGAACAAGA 


TCAAAAGTGC 


TCTACTATCC 


6180 


CTTATCAAAT 


GGAGTTAGTG 


TTTTAACCAG 


CAAAGTGTTA 


6240 
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GACCTCAAAA 


ACTATATAGA 


TAAACAATTG 


TTACCTATTG 


TGAACAAGCA 


AAGCTGCAGC 


6300 


ATATCAAATA 


TAGAAACTGT 


GATAGAGTTC 


CAACAAAAGA 


ACAACAGACT 


ACTAGAGATT 


6360 


ACCAGGGAAT 


T T AGTGTTAA 


TGCAGGCGTA 


ACTACACCTG 


TAAGCACTTA 


CATGTTAACT 


6420 


A AT AGTGAAT 


TATTGT CATT 


AATCAATGAT 


ATGCCTATAA 


CAAATGATCA 


GAAAAAGTTA 


6480 


liTGTr'PAACA 


ATGTTCAAAT 


AGTTAGACAG 


CAAAGTTACT 


CTATCATGTC 


CATAATAAAA 


6540 


HAGGAAGTCT 


TAGCATATGT 


AGTACAATTA 


CCACTATATG 


GTGTTATAGA 


TACACCCTGT 


6600 


TG G A A A PT AC 


ACACATCCCC 


TCTATGTACA 


ACCAACACAA 


AAGAAGGGTC 


CAACATCTGT 


6660 


TTAACAAGAA 


CTGACAGAGG 


ATGGTACTGT 


GACAATGCAG 


GATCAGTATC 


TTTCTTCCCA 


6720 


PAAGPTGAAA 


CATGTAAAGT 


TCAATCAAAT 


CGAGTATTTT 


GTGACACAAT 


GAACAGTTTA 


6780 


ACATTACCAA 


GTGAAGTAAA 


TCTCTGCAAT 


GTTGACATAT 


TCAACCCCAA 


ATATGATTGT 


6840 


AAAATTATGA 


CTTCAAAAAC 


AGATGTAAGC 


AGCTCCGTTA 


TCACATCTCT 


AGGAGCCATT 


6900 


GTGTC ATG CT 


ATGGCAAAAC 


TAAATGTACA 


GCATCCAATA 


AAAATCGTGG 


AATCATAAAG 


6960 


2i P A 'PTTT PT A 

rt^rt X XX X \«. X n 


ACGGGTGCGA 


TTATGTATCA 


AATAAAGGGG 


TGGACACTGT 


GTCTGTAGGT 


7020 


AACACATTAT 


ATTATGTAAA 


TAAGCAAGAA 


GGTAAAAGTC 


TCTATGTAAA 


AGGTGAACCA 


7080 


ATAATAAATT 
rtX*v^x*v***x X 


TCTATGACCC 


ATTAGTATTC 


CCCTCTGATG 


AATTTGATGC 


ATCAATATCT 


7140 


P 2V AriTP A APG 


AGAAGATTAA 


CCAGAGCCTA 


GCATTTATTC 


GTAAATCCGA 


TGAATTATTA 


7200 


r*ATAATGTAA 


ATGCTGGTAA 


ATCCACCACA 


AATATCATGA 


TAACTACTAT 


AATTATAGTG 


7260 


A 1 X >^ X 1 MM 


T ATTGTT AT C 
X /^x X \j X X \» 


ATTAATTGCT 


GTTGGACTGC 


TCTTATACTG 


TAAGGCCAGA 


7320 




TP A PAPTAAG 


CAAAGATCAA 


CTGAGTGGTA 


TAAATAATAT 


TGCATTTAGT 


7380 


AAl. i AAA I AA 


a anTZiGPAPP 


TAATCATGTT 


CTTACAATGG 


TTTACTATCT 


GCTCATAGAC 


7440 




X X X VJVJ A X X 


TTPTTAAAAT 


CTGAACTTCA 


TCGAAACTCT 


CATCTATAAA 


7500 


/^/^ TV T'/^TT^ a ^■'T* 
U-fJAl C 1 wAv- i 


TaPAPTATTT 

X >Wu.AV> X AAA 


AAGTAGATTC 


CTAGTTTATA 


GTTATATAAA 


ACACAATTGC 


7560 


AluCUAVaAl 1 


A&PTTAPCAT 


CTGTAAAAAT 


GAAAACTGGG 


GCAAATATGT 


CACGAAGGAA 


7620 




TTTGAAATTC 


GAGGTCATTG 


CTTAAATGGT 


AAGAGGTGTC 


ATTTTAGTCA 


7680 


1 AAl 1 AX XXX 


GAATGGCCAC 


CCCATGCACT 


GCTTGTAAGA 


CAAAACTTTA 


TGTTAAACAG 


7740 


a aT*a(^TTaar! 

AA X A\- X X AAu 


T PT ATGG ATA 

X Var X X X #v 


AAAGTATAGA 


TACCTTATCA 


GAAATAAGTG 


GAGCTGCAGA 


7800 


X X ^oAV-mAuA 


A P AG AAGAGT 


ATGCTCTTGG 


TGTAGTTGGA 


GTGCTAGAGA 


GTTATATAGG 


7860 


aTPaaTaa 

n, X \..~rvrl X nn*x\^ 


AATATAACTA 


AACAATCAGC 


ATGTGTTGCC 


ATGAGCAAAC 


TCCTCACTGA 


7920 


ACTCAATAGT 


GATGATATCA 


AAAAGCTGAG 


GGACAATGAA 


VjAuV. X AAA X X 


f^aof^faarsAT 


798 0 


AAGAGTGTAC 


AATACTGTCA 


TATCATATAT 


TGAAAGCAAC 


AGGAAAAACA 


ATAAACAAAC 


8040 


TATCCATCTG 


TTAAAAAGAT 


TGCCAGCAGA 


CGTATTGAAG 


AAAACCATCA 


AAAACACATT 


8100 


GGATATCCAT 


AAGAGCATAA 


CCATCAACAA 


CCCAAAAGAA 


TCAACTGTTA 


GTGATACAAA 


8160 


TGACCATGCC 


AAAAATAATG 


ATACTACCTG 


ACAAATATCC 


TTGTAGTATA 


ACTTCCATAC 


8220 


TAATAACAAG 


TAGATGTAGA 


GTTACTATGT 


ATAATCAAAA 


GAACACACTA 


TATTTCAATC 


8280 
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AAAACAACCC 


AAATAACCAT 


ATGTACTCAC 


CGAATCAAAC 


ATTCAATGAA 


ATCCATTGGA 


8340 


CCTCTCAAGA 


ATTGATTGAC 


ACAATTCAAA 


ATTTTCTACA 


ACATCTAGGT 


ATTATTGAGG 


8400 


ATATATATAC 


AATATATATA 


TTAGTGTCAT 


AACACTCAAT 


TCTAACACTC 


ACCACATCGT 


8460 


TACATTATTA 


ATTCAAACAA 


TTCAAGTTGT 


GGGACAAAAT 


GGATCCCATT 


ATTAATGGAA 


8S20 


ATTCTGCTAA 


TGTTTATCTA 


ACCGATAGTT 


ATTTAAAAGG 


TGTTATCTCT 


TTCTCAGAGT 


8580 


GTAATGCTTT 


AGGAAGTTAC 


ATATTCAATG 


GTCCTTATCT 


CAAAAATGAT 


TATACCAACT 


8640 


TAATTAGTAG 


ACAAAATCCA 


TTAATAGAAC 


ACATGAATCT 


AAAGAAACTA 


AATATAACAC 


8700 


AGTCCTTAAT 


ATCTAAGTAT 


CATAAAGGTG 


AAATAAAATT 


AGAAGAACCT 


ACTTATTTTC 


8760 


AGTCATTACT 


TATGACATAC 


AAGAGTATGA 


CCTCGTCAGA 


ACAGATTGCT 


ACCACTAATT 


8820 


TACTTAAAAA 


GATAATAAGA 


AGAGCTATAG 


AAATAAGTGA 


TGTCAAAGTC 


TATGCTATAT 


8860 


TGAATAAACT 


AGGGCTTAAA 


GAAAAGGACA 


AGATTAAATC 


CAACAATGGA 


CAAGATGAAG 


8940 


ACAACTCAGT 


TATTACGACC 


ATAATCAAAG 


ATGATATACT 


TTCAGCTGTT 


AAAGATAATC 


9000 


AATCTCATCT 


TAAAGCAGAC 


AAAAATCACT 


CTACAAAACA 


AAAAGACACA 


ATCAAAACAA 


9060 


CACTCTTGAA 


GAAATTGATG 


TGTTCAATGC 


AACATCCTCC 


ATCATGGTTA 


ATACATTGGT 


9120 


TTAACTTATA 


CACAAAATTA 


AACAACATAT 


TAACACAGTA 


TCGATCAAAT 


GAGGTAAAAA 


9180 


ACCATGGGTT 


TACATTGATA 


GATAATCAAA 


CTCTTAGTGG 


ATTTCAATTT 


ATTTTGAACC 


9240 


AATATGGTTG 


TATAGTTTAT 


CATAAGGAAC 


TCAAAAGAAT 


TACTGTGACA 


ACCTATAATC 


9300 


AATTCTTGAC 


ATGGAAAGAT 


ATTAGCCTTA 


GTAGATTAAA 


TGTTTGTTTA 


ATTACATGGA 


9360 


TTAGTAACTG 


CTTGAACACA 


TTAAATAAAA 


GCTTAGGCTT 


AAGATGCGGA 


TTCAATAATG 


9420 


TTATCTTGAC 


ACAACTATTC 


CTTTATGGAG 


ATTGTATACT 


AAAGCTATTT 


CACAATGAGG 


9460 


GGTTCTACAT 


AATAAAAGAG 


GTAGAGGGAT 


TTATTATGTC 


TCTAATTTTA 


AATATAACAG 


9540 


AAGAAGATCA 


ATTCAGAAAA 


CGATTTTATA 


ATAGTATGCT 


CAACAACATC 


ACAGATGCTG 


9600 


CTAATAAAGC 


TCAGAAAAAT 


CTGCTATCAA 


GAGTATGTCA 


TACATTATTA 


GATAAGACAG 


9660 


TGTCCGATAA 


TATAATAAAT 


GGCAGATGGA 


TAATTCTATT 


AAGTAAGTTC 


CTTAAATTAA 


9720 


TTAAGCTTGC 


AGGTGACAAT 


AACCTTAACA 


ATCTGAGTGA 


ACTATATTTT 


TTGTTCAGAA 


9780 


TATTTGGACA 


CCCAATGGTA 


GATGAAAGAC 


AAGCCATGGA 


TGCTGTTAAA 


ATTAATTGCA 


9840 


ATGAGACCAA 


ATTTTACTTG 


TTAAGCAGTC 


TGAGTATGTT 


AAGAGGTGCC 


TTTATATATA 


9900 


GAATTATAAA 


AGGGTTTGTA 


AATAATTACA 


ACAGATGGCC 


TACTTTAAGA 


AATGCTATTG 


9960 


TTTTACCCTT 


AAGATGGTTA 


ACTTACTATA 


AACTAAACAC 


TTATCCTTCT 


TTGTTGGAAC 


10020 


TTACAGAAAG 


AGATTTGATT 


GTGTTATCAG 


GACTACGTTT 


CTATCGTGAG 


TTTCGGTTGC 


10080 


CTAAAAAAGT 


GGATCTTGAA 


ATGATTATAA 


ATGATAAAGC 


TATATCACCT 


CCTAAAAATT 


10140 


TGATATGGAC 


TAGTTTCCCT 


AGAAATTACA 


TGCCATCACA 


CATACAAAAC 


TATATAGAAC 


10200 


ATGAAAAATT 


AAAATTTTCC 


GAGAGTGATA 


AATCAAGAAG 


AGTATTAGAG 


TATTATTTAA 


10260 


GAGATAACAA 


ATTCAATGAA 


TGTGATTTAT 


ACAACTGTGT 


AGTTAATCAA 


AGTTATCTCA 


10320 
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ACAACCCTAA 


TCATGTGGTA 


TCATTGACAG 


GCAAAGAAAG 


AGAACTCAGT 


GTAGGTAGAA 


10380 


TGTTTGCAAT 


GCAACCX3GGA 


ATGTTCAGAC 


AGGTTCAAAT 


ATTGGCAGAG 


AAAATGATAG 


10440 


CTGAAAACAT 


TTTACAATTC 


TTTCCTGAAA 


GTCTTACAAG 


ATATGGTGAT 


CTAGAACTAC 


10500 


AAAAAATATT 


AGAACTGAAA 


GCAGGAATAA 


GTAACAAATC 


AAATCGCTAC 


AATGATAATT 


10560 


ACAACAATTA 


CATTAGTAAG 


TGCTCTATCA 


TCACAGATCT 


CAGCAAATTC 


AATCAAGCAT 


10620 


TTCGATATGA 


AACGTCATGT 


ATTTGTAGTG 


ATGTGCTGGA 


TGAACTGCAT 


GGTGTACAAT 


10680 


CTCTATTTTC 


CTGGTTACAT 


TTAACTATTC 


CTCATGTCAC 


AATAATATGC 


ACATATAGGC 


10740 


ATGCACCCCC 


CTATATAGGA 


GATCATATTG 


TAGATCTTAA 


CAATGTAGAT 


GAACAAAGTG 


10800 


GATTATATAG 


ATATCACATG 


GGTGGCATCG 


AAGGGTGGTG 


TCAAAAACTA 


TGGACCATAG 


10860 


AAGCTATATC 


ACTATTGGAT 


CTAATATCTC 


TCAAAGGGAA 


ATTCTCAATT 


ACTGCTTTAA 


10920 


TTAATGGTGA 


CAATCAATCA 


ATAGATATAA 


GCAAACCAAT 


CAGACTCATG 


GAAGGTCAAA 


10980 


CTCATGCTCA 


AGCAGATTAT 


TTGCTAGCAT 


TAAATAGCCT 


TAAATTACTG 


TATAAAGAGT 


11040 


ATGCAGGCAT 


AGGCCACAAA 


TTAAAAGGAA 


CTGAGACTTA 


TATATCACGA 


GATATGCAAT 


11100 


TTATGAGTAA 


AACAATTCAA 


CATAACGGTG 


TATATTACCC 


AGCTAGTATA 


AAGAAAGTCC 


11160 


TAAGAGTGGG 


ACCGTGGATA 


AACACTATAC 


TTGATGATTT 


CAAAGTGAGT 


CTAGAATCTA 


11220 


TAGGTAGTTT 


GACACAAGAA 


TTAGAATATA 


GAGGTGAAAG 


TCTATTATGC 


AGTTTAATAT 


11280 


TTAGAAATGT 


ATGGTTATAT 


AATCAGATTG 


CTCTACAATT 


AAAAAATCAT 


GCATTATGTA 


11340 


ACAATAAACT 


ATATTTGGAC 


ATATTAAAGG 


TTCTGAAACA 


CTTAAAAACC 


Tri'TlTAATC 


11400 


TTGATAATAT 


TGATACAGCA 


TTAACATTGT 


ATATGAATTT 


ACCCATGTTA 


TTTGGTGGTG 


11460 


GTGATCCCAA 


CTTGTTATAT 


CGAAGTTTCT 


ATAGAAGAAC 


TCCTGACTTC 


CTCACAGAGG 


11520 


CTATAGTTCA 


CTCTGTGTTC 


ATACTTAGTT 


ATTATACAAA 


CCATGACTTA 


AAAGATAAAC 


11580 


TTCAAGATCT 


GTCAGATGAT 


AGATTGAATA 


AGTTCTTAAC 


ATGCATAATC 


ACGTTTGACA 


11640 


AAAACCCTAA 


TGCTGAATTC 


GTAACATTGA 


TGAGAGATCC 


TCAAGCTTTA 


GGGTCTGAGA 


11700 


G ACAAG CTAA 


AATTACTAGC 


GAAATCAATA 


GACTGGCAGT 


TACAGAGGTT 


TTGAGTACAG 


11760 


CTCCAAACAA 


AATATTCT C C 


AAAAGTGCAC 


AACATTATAC 


TACTACAGAG 


ATAGATCTAA 


11820 


ATGATATTAT 


GCAAAATATA 


GAACCTACAT 


ATCCTCATGG 


GCTAAGAGTT 


GTTTATGAAA 


11880 


GTTTAC CCTT 


TTATAAAGCA 


GAGAAAATAG 


TAAATCTTAT 


ATCAGGTACA 


AAATCTATAA 


11940 


CTAACATACT 


GGAAAAAACT 


TCTGCCATAG 


ACTTAACAGA 


TATTGATAGA 


GCCACTGAGA 


12000 


TGATGAGGAA 


AAACATAACT 


TTGCTTATAA 


GGATACTTCC 


ATTGGATTGT 


AACAGAGATA 


12 060 


AAAGAGAGAT 


ATTGAGTATG 


GAAAACCTAA 


GTATTACTGA 


ATTAAGCAAA 


TATGTTAGGG 


12120 


AAAGATCTTG 


GTCTTTATCC 


AATATAGTTG 


GTGTTACATC 


ACCCAGTATC 


ATGTATACAA 


12180 


TGGACATCAA 


ATATACTACA 


AGCACTATAT 


CTAGTGGCAT 


AATTATAGAG 


AAATATAATG 


12240 


TTAACAGTTT 


AACACGTGGT 


GAGAGAGGAC 


CCACTAAACC 


ATGGGTTGGT 


TCATCTACAC 


12300 


AAGAGAAAAA 


AACAATGCCA 


GTTTATAATA 


GACAAGTCTT 


AACCAAAAAA 


CAGAGAGATC 


12360 
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AAATAGATCT 


ATTAGCAAAA 


TTGGATTGGG 


TGTATGCATC 


TATAGATAAC 


AAGGATGAAT 


12420 


TCATGGAAGA 


ACTCAGCATA 


GGAACCCTTG 


GGTTAACATA 


TGAAAAGGCC 


AAGAAATTAT 


12480 


TTCCACAATA 


TTTAAGTGTC 


AATTATTTGC 


ATCGCCTTAC 


AGTCAGTAGT 


AGACCATGTG 


12540 


AATTCCCTGC 


ATCAATACCA 


GCTTATAGAA 


CAACAAATTA 


TCACTTTGAC 


ACTAGCCCTA 


12600 


TTAATCGCAT 


ATTAACAGAA 


AAGTATGGTG 


ATGAAGATAT 


TGACATAGTA 


TTCCAAAACT 


12660 


GTATAAGCTT 


TGGCCTTAGT 


TTAATGTCAG 


TAGTAGAACA 


ATTTACTAAT 


GTATGTCCTA 


12720 


ACAGAATTAT 


TCTCATACCT 


AAGCTTAATG 


AGATACATTT 


GATGAAACCT 


CCCATATTCA 


12780 


CAGGTGATGT 


TGATATTCAC 


AAGTTAAAAC 


AAGTGATACA 


AAAACAGCAT 


ATGTTTTTAC 


12840 


CAGACAAAAT 


AAGTTTGACT 


CAATATGTGG 


AATTATTCTT 


AAGTAATAAA 


ACACTCAAAT 


12900 


CTGGATCTCA 


TGTTAATTCT 


AATTTAATAT 


TGGCACATAA 


AATATCTGAC 


TATTTTCATA 


12960 


ATACTTACAT 


TTTAAGTACT 


AATTTAGCTG 


GACATTGGAT 


TCTGATTATA 


CAACTTATGA 


13020 


AAGATTCTAA 


AGGTATrriT 


GAAAAAGATT 


GGGGAGAGGG 


ATATATAACT 


GATCATATGT 


13080 


TTATTAATTT 


GAAAGTTTTC 


TTCAATGCTT 


ATAAGACCTA 


TCTCTTGTGT 


TTTCATAAAG 


13140 


GTTATGGCAA 


AGCAAAGCTG 


GAGTGTGATA 


TGAACACTTC 


AGATCTTCTA 


TGTGTATTGG 


13200 


AATTAATAGA 


CAGTAGTTAT 


TGGAAGTCTA 


TGTCTAAGGT 


ATTTTTAGAA 


CAAAAAGTTA 


13260 


TCAAATACAT 


TCTTAGCCAA 


GATGCAAGTT 


TACATAGAGT 


AAAAGGATGT 


CATAGCTTCA 


13320 


AATTATGGTT 


TCTTAAACGT 


CTTAATGTAG 


CAGAATTCAC 


AGTTTGCCCT 


TGGGTTGTTA 


13380 


ACATAGATTA 


TCATCCAACA 


CATATGAAAG 


CAATATTAAC 


TTATATAGAT 


CTTGTTAGAA 


13440 


TGGGATTGAT 


AAATATAGAT 


AGAATACACA 


TTAAAAATAA 


ACACAAATTC 


AATGATGAAT 


13500 


TTTATACTTC 


TAATCTCTTC 


TACATTAATT 


ATAACTTCTC 


AGATAATACT 


CATCTATTAA 


13560 


CTAAACATAT 


AAGGATTGCT 


AATTCTGAAT 


TAGAAAATAA 


TTACAACAAA 


TTATATCATC 


13620 


CTACACCAGA 


AACCCTAGAG 


AATATACTAG 


CCAATCCGAT 


TAAAAGTAAT 


GACAAAAAGA 


13680 


CACTGAATGA 


CTATTGTATA 


GGTAAAAATG 


TTGACTCAAT 


AATGTTACCA 


TTGTTATCTA 


13740 


ATAAGAAGCT 


TATTAAATCG 


TCTGCAATGA 


TTAGAACCAA 


TTACAGCAAA 


CAAGATTTGT 


13800 


ATAATTTATT 


CCCTATGGTT 


GTGATTGATA 


GAATTATAGA 


TCATTCAGGC 


AATACAGCCA 


13860 


AATCCAACCA 


ACTTTACACT 


ACTACTTCCC 


ACCAAATATC 


CTTAGTGCAC 


AATAGCACAT 


13920 


CACTTTACTG 


CATGCTTCCT 


TGGCATCATA 


TTAATAGATT 


CAATTTTGTA 


TTTAGTTCTA 


13980 


CAGGTTGTAA 


AATTAGTATA 


GAGTATATTT 


TAAAAGATCT 


TAAAATTAAA 


GATCCCAATT 


14040 


GTATAGCATT 


CATAGGTGAA 


GGAGCAGGGA 


ATTTATTATT 


GCGTACAGTA 


GTGGAACTTC 


14100 


ATCCTGACAT 


AAGATATATT 


TACAGAAGTC 


TGAAAGATTG 


CAATGATCAT 


AGTTTACCTA 


14160 


TTGAGTTTTT 


AAGGCTGTAC 


AATGGACATA 


TCAACATTGA 


TTATGGTGAA 


AATTTGACCA 


14220 


TTC CTGCTAC 


AGATGCAACC 


AACAACATTC 


ATTGGTCTTA 


TTTACATATA 


AAGTTTGCTG 


14280 


AACCTATCAG 


TCTTTTTGTC 


TGTGATGCCG 


AATTGTCTGT 


AACAGTCAAC 


TGGAGTAAAA 


14340 


TTATAATAGA 


ATGGAGCAAG 


CATGTAAGAA 


AGTGCAAGTA 


CTGTTCCTCA 


GTTAATAAAT 


14400 
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GTATGTTAAT AGTAAAATAT CATGCTCAAG ATGATATTGA TTTCAAATTA GACAATATAA 144 60 

CTATATTAAA AACTTATGTA TGCTTAGGCA GTAAGTTAAA GGGATCGGAG GTTTACTTAG 14 520 

TCCTTACAAT AGGTCCTGCG AATATATTCC CAGTATTTAA TGTAGTACAA AATGCTAAAT 14 580 

TGATACTATC AAGAACCAAA AATTTCATCA TGCCTAAGAA AGCTGATAAA GAGTCTATTG 14 64 0 

ATGCAAATAT TAAAAGTTTG ATACCCTTTC TTTGTTACCC TATAACAAAA AAAGGAATTA 14700 

ATACTGCATT GTCAAAACTA AAGAGTGTTG TTAGTGGAGA TATACTATCA TATTCTATAG 14 760 

CTGGACGTAA TGAAGTTTTC AGCAATAAAC TTATAAATCA TAAGCATATG AACATCTTAA 14 820 

AATGGTTCAA TCATGTTTTA AATTTCAGAT CAACAGAACT AAACTATAAC CATTTATATA 14880 

TGGTAGAATC TACATATCCT TACCTAAGTG AATTGTTAAA CAGCTTGACA ACCAATGAAC 14940 

TTAAAAAACT GATTAAAATC ACAGGTAGTC TGTTATACAA CTTTCATAAT GAATAATGAA 15000 

TAAAGATCTT ATAATAAAAA TTCCCATAGC TATACACTAA CACTGTATTC AATTATAGTT 15060 

ATTAAAAATT AAAAATCATA TAATTTTTTA AATAACTTTT AGTGAACTAA TCCTAAAGTT 15120 

ATCATTTTAA TCTTGGAGGA ATAAATTTAA ACCCTAATCT AATTGGTTTA TATGTGTATT 15180 

AACTAAATTA CGAGATATTA GTTTTTGACA CTTTTTTTCT CGT 15223 
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1. An isolated infectious RSV particle which 
comprises a recombinant RSV genome or antigenome, a 

5 nucleocapsid (N) protein, a nucleocapsid phosphoprotein (P) , a 
large (L) polymerase protein, and an RNA polymerase elongation 
factor. 

2. The isolated infectious RSV particle of claim 1, 
10 wherein the RNA polymerase elongation factor is M2 (ORFl) 

protein of RSV. 

3. The isolated infectious RSV particle of claim 1, 
which is a subviral particle. 

15 

4. The isolated infectious RSV particle of claim 1, 
which is a virus. 

5. The isolated infectious RSV particle of claim 3, 
20 wherein the particle comprises an RSV antigenome. 

6. The isolated infectious RSV particle of claim 1, 
wherein the particle comprises a recombinant RSV antigenome. 

25 7. The isolated infectious RSV particle of claim 3, 

wherein the particle comprises an RSV genome. 

8. The isolated infectious RSV particle of claim 1, 
wherein the particle comprises a recombinant RSV genome. 



30 



9. The isolated infectious RSV virus of claim 1, 
which is a human RSV. 



35 



10. The isolated infectious RSV virus of claim 1, 
which is a bovine or murine RSV. 



wo 97/12032 . PCT/US96/15524 

53 

11. The isolated infectious RSV of claim 1, having 
a genome or antigenome that is a chimera of two or more 
different RSV genomes. 

5 12. The isolated infectious RSV virus of claim 11, 

wherein the chimeric genome or antigenome comprises nucleotide 
sequences from human and bovine RSV. 

13. A method for producing an infectious RSV 

10 particle from one or more isolated polynucleotide molecules 
encoding said RSV, comprising: 

coexpressing in a cell or cell- free lysate an 
expression vector which comprises an isolated polynucleotide 
molecule encoding a RSV genome or antigenome and an expression 

15 vector which comprises one or more isolated polynucleotide 

molecules that encodes N, P, L and RNA polymerase elongation 
factor proteins, thereby producing an infectious RSV particle. 

14. The method of claim 13, wherein the RSV genome 
20 or antigenome and the N, P, L and RNA polymerase elongation 

factor proteins are expressed by the same expression vector. 

15. The method of claim 13, wherein the expression 
vector encoding the RSV genome or antigenome and the 

25 expression vector encoding the N, P, L and RNA polymerase 
elongation factor proteins are different. 

16. The method of claim 13, wherein the N, P, L and 
RNA polymerase elongation factor proteins are encoded on two 

30 or more different expression vectors. 

17. The method of claim 16, wherein the N, P, L and 
RNA polymerase elongation factor proteins are each encoded on 
different expression vectors. 

35 
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18. The method of claim 13, wherein the isolated 
polynucleotide molecule that encodes a RSV genome or 
ant i genome is cDNA. 

5 19. The method of claim 13, wherein the infectious 

RSV particle is a virus. 

20. The method of claim 19, wherein the 
polynucleotide molecule encoding a RSV genome or antigenome is 

10 from a human, bovine or murine RSV sequence. 

21. The method of claim 20, wherein the 
polynucleotide molecule encoding a RSV genome or antigenome is 
a chimera of a human RSV strain sequence and at least one non- 

15 human RSV sequence. 

22. The method of claim 13, wherein the 
polynucleotide molecule encoding the RSV genome or antigenome 
encodes the sequence of a wild-type RSV strain. 

20 

23. The method of claim 13, wherein the 
polynucleotide molecule encodes an RSV genome or antigenome 
that has been modified from a wild-type RSV strain by a 
nucleotide insertion, rearrangement, deletion or substitution. 

25 

24. The method of claim 23, wherein the 
modification encodes a phenotypic alteration. 

25. The method of claim 24, wherein the 

30 polynucleotide molecule encodes a genome or antigenome of a 
nonhuman RSV . 
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26. The method of claim 24, wherein the phenotypic 
alteration results in attenuation, temperature-sensitivity, 
cold-adaptation, small plaque size or host range restriction. 
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27. The method of claim 23, wherein the 
polynucleotide encodes a genome or antigenome of a nonhuman 
RSV virus or is a chimera of a nonhuman RSV and at least one 
other RSV or human or nonhuman origin. 

5 

28. The method of claim 13, wherein the 
polynucleotide molecule encodes an RSV genome or antigenome of 
a RSV human vaccine strain that has been modified by a 
nucleotide insertion, deletion or substitution. 

10 

29. The method of claim 28, wherein the 
modification encodes a phenotypic alteration. 

30. The method of claim 29, wherein the phenotypic 
15 alteration results in attenuation, temperature-sensitivity, 

cold-adaptation, small plaque size or host range restriction, 

31. The method of claim 29, wherein the phenotypic 
alteration is a change in an immunogenic epitope of RSV. 

20 

32. The method of claim 23, wherein the 
polynucleotide molecule that encodes an RSV genome or 
antigenome of an RSV strain has been modified by inserting a 
nucleotide sequence that encodes a cytokine or a T-helper 

25. epitope. 

33. The method of claim 23, wherein the 
polynucleotide molecule that encodes an RSV genome or 
antigenome of an RSV strain has been modified by inserting a 

30 nucleotide sequence encoding a restriction site marker. 

34. The method of claim 23, wherein the 
polynucleotide molecule that encodes an RSV genome or 
antigenome of an RSV strain has been modified by inserting a 

3 5 nucleotide sequence encoding a G protein of an RSV subgroup 
different from that of said RSV strain. 
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35. The method of claim 23, wherein the 
polynucleotide molecule that encodes an RSV genome or 
antigenome of an RSV strain has been modified by inserting a 
nucleotide sequence encoding a protein of a microbial pathogen 

5 capable of eliciting a protective immune response. 

36. The method of claim 13, wherein at least one of 
the viral proteins is supplied by coinfection with RSV. 

10 37. A cell or cell-free lysate containing an 

expression vector which comprises an isolated polynucleotide 
molecule encoding a RSV genome or antigenome and an expression 
vector which comprises one or more isolated polynucleotide 
molecules that encodes P, L and RNA polymerase elongation 

15 factor proteins of RSV, whereby upon expression said RSV 
genome or antigenome and N, P, L, and RNA polymerase 
elongation factor proteins combine to produce an infectious 
RSV particle . 

20 38. The cell or lysate of claim 37, wherein the RSV 

genome or antigenome and the P, L and RNA polymerase 
elongation factor proteins are encoded by the same expression 
vector. 

25 39. The cell or lysate of claim 37, wherein the 

expression vector encoding the RSV genome or antigenome and 
the expression vector encoding the N, P, L and RNA polymerase 
elongation factor proteins are different. * 

30 40. The cell or lysate of claim 37, wherein the N, 

P, L and RNA polymerase elongation factor proteins are encoded 
on two or more expression vectors. 

41. The cell or lysate of claim 40, wherein the N, 
35 P, L and RNA polymerase elongation factor proteins are each 
encoded on different expression vectors. 
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42. The cell or lysate of claim 31, wherein the 
infectious RSV particle is a virus. 

43. The cell or lysate of claim 37, wherein the 

s polynucleotide molecule encoding a RSV genome or antigenome is 
a human, bovine or murine RSV sequence- 

44. An isolated polynucleotide molecule which 
comprises an operably linked transcriptional promoter, a 

10 polynucleotide sequence encoding an RSV genome or antigenome, 
and a transcriptional terminator, 

45. The isolated polynucleotide molecule of claim 

44, wherein the polynucleotide sequence encoding an RSV genome 
15 or antigenome is a human RSV sequence. 

46. The isolated polynucleotide molecule of claim 

45, wherein the polynucleotide sequence encoding a human RSV 
genome or antigenome is SEQ ID N0:1. 



20 



47, The isolated polynucleotide molecule of claim 
44, wherein the polynucleotide encodes a genome or antigenome 
of a nonhuman RSV virus or is a chimera of a nonhuman RSV and 
at least one other RSV of human or nonhuman origin. 



25 
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